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Abstract 

We study, both numerically and theoretically, the relationship between the random 
Lyapunov exponent of a family of area preserving diffeomorphisms of the 2-sphere 
and the mean of the Lyapunov exponents of the individual members. The motivation 
for this study is the hope that a rich enough family of diffeomorphisms will always 
have members with positive Lyapunov exponents, that is to say, positive entropy. At 
question is what sort of notion of richness would make such a conclusion valid. One 
type of richness of a family - invariance under the left action of SO(n + 1) - occurs 
naturally in the context of volume preserving diffeomorphisms of the n-sphere. Based 
on some positive results for families linear maps obtained by Dedieu and Shub, we 
investigate the exponents of such a family on the 2-sphere. Again motivated by the 
linear case, we investigate whether there is in fact a lower bound for the mean of the 
Lyapunov exponents in terms of the random exponents (with respect to the push- 
forward of Haar measure on SO (3)) in such a family. The family J- e that we study 
contains a twist map with stretching parameter e. 

In the family J- £ , we find strong numerical evidence for the existence of such a lower 
bound on mean Lyapunov exponents, when the values of the stretching parameter e 
are not too small. Even moderate values of e like e > 10 are enough to have an average 
of the metric entropy larger than that of the random map. For small e the estimated 
average entropy seems positive but is definitely much less than the one of the random 
map. The numerical evidence is in favor of the existence of exponentially small lower 
and upper bounds (in the present example, with an analytic family). 

Finally, the effect of a small randomization of fixed size 8 of the individual elements 
of the family T e is considered. Now the mean of the local random exponents of the 
family is indeed asymptotic to the random exponent of the entire family as e tends to 
infinity. 
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1 Introduction 



Numerical experiments with area-preserving surface diffeomorphisms often produce the 
following dynamical picture: elliptical islands floating in ergodic seas. A reasonable guess 
is that these ergodic seas typically have positive measure, and further, that the Lyapunov 
exponents on these seas are on average nonzero. An example of tiny elliptical islands in 
the context of differential equations can be found in JNSV ], where all rough numeric tests 
are in favor of ergodic behavior. 

In this paper, we add to the pile of experimental evidence in favor of this conjecture.]] 
We also discuss a possible theoretical approach to finding positive Lyapunov exponents in 
certain families of area-preserving diffeomorphisms of the sphere S 2 . The possibility of 
such an approach was discussed in [BuPuShWi]. The families we consider are not obtained 



from a specific set of equations, but from the following construction. Let / : S 2 — > S 2 
be an area-preserving diffeomorphism of the round sphere, and let SO (3) be the isometry 
group of S 2 . Let 

F = {g°f I )?£ SO(3)} 
be the left 50(3)-coset of / in Diff (S 2 ), and let v be the push-forward of Haar measure on 
SO (3) to J-. Provided that / is not itself an isometry, the family T has nonzero random 
Lyapunov exponents with respect to v (see Proposition [T^ below). The question this 
paper addresses is whether these random exponents can somehow be connected to the 
Lyapunov exponents of individual members of at least on u-average. 

To test whether there might be such a connection, we chose / to be a twist map, 
all of whose Lyapunov exponents are zero. The resulting family J- has similarities to 
the standard family on the 2-torus. The dynamics of the individual elements of T and 
how they depend on parameters is an interesting topic, but we only study here some key 
properties in the case of small e. We mainly focus on two quantities, the random exponent 
R{v) and the average exponent A(z/), which we now define. 

Let [i be Lebesgue measure on S 2 normalized to be a probability measure. Suppose for 
now that v is an arbitrary Borel probability measure supported on a subset T of Diff A1 (5 2 ), 
the space of /x-preserving diffeomorphisms of S 2 . For f £ J 7 , the largest Lyapunov of / at 
x G S 2 is found by computing the limit: 

lim -log\\T x f n \\ =\!(x,f), (1) 

n— >oo n 

which exists for /x-almost every x by the subadditive ergodic theorem. We define the 
average exponent of f to be 

Kf)= [ \i(f,x)dn(x), (2) 
Js 2 

and the average exponent of v to be 

A(u)= [ \{f)du{f). (3) 

! Due to space limitations, this version of the paper does not contain figures. A version 
with the figures is available at either http: //www. math. northwestern.edu/~wilkinso/papers 
(8| x 11 format) or http: //www. maia.ub.es/dsg/2002/index. html (A4 format). 
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Rather than iterate a single diffeomorphism / G we might choose instead a sequence 
of diffeomorphisms {fx, f%, . . .} C J- and form their composition: 

f (n) :=/ n o/ ft _ l0 ...o/ a . 

If the sequence is chosen to be independent and identically distributed with respect to v , 
then almost surely the limit 

lim ilog||T,/(")|| =: i?(x, (/^i/) (4) 

n— >oo n 

will exist, for /i-almost every (This too follows from the subadditive ergodic theorem, 
applied in the appropriate context). Further, the integral of R(x, v) with respect to 

[i is almost surely independent of the sequence (/t)x°. We define the random exponent of 
v to be this integral: 



R(u) = / R(x,(fi)?y)dfi(x). (5) 
is 2 

(see Kifer for an introduction to the subject of random diffeomorphisms and their expo- 
nents. We also give a self-contained introduction in section ^). The random exponent 
R{v) is usually positive, unless v is fairly degenerate Ca|. 



The quantity A(u) is mysterious from a computational perspective, but useful from a 
dynamical one. The quantity R{v) is relatively easy to estimate and is often positive. 

Our goal is to understand in general if there is a notion of richness for a probability 
measure u on the volume preserving diffeomorphisms of a closed manifold M such that 
the positivity of R{v) implies the positivity of A(u). Here we are investigating whether the 
SO(3) invariance of the measure v on Diff^S* 2 ) might provide such a notion of richness. 
In jBuPuShWj ] we asked if even more might hold, that we might bound A(z^) from below 
in terms of R{u). 

Question 1.1 Is there a positive constant C — perhaps 1 — such that A(i^) > CR{u)? 



Some motivation for Question 1.1 can be found in similar question for the iterates of 



linear maps (see |DeSh | where an affirmative answer to the analogue of Question |0] is 



proven with C = 1). In Section y, we describe a theoretical framework in which to address 



Question [O] and related questions. We discuss the linear case in Section ||. 

Returning to the specific family of diffeomorphisms mentioned earlier, we now describe 
the experiment in more detail. 

For e > 0, we define a one-parameter family of twist maps f e as follows. Express S 2 
as the sphere of radius 1/2 centered at (0, 0) in R x C, so that the coordinates (r, z) G S 2 
satisfy the equation 

\r\ 2 + \z\ 2 = 1/4. 

In these coordinates define a twist map f e : S 2 — > S 2 , for e > 0, by 

f £ (r,z) = (r,exp(27ri(r + l/2)e)z). 
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Let T e be the orbit SO(3)f £ . Let v be the push-forward of Haar measure on 50(3). 
We denote the resulting random and average Lyapunov exponents by R(e) and A(e), 
respectively. 

The numerical results are described in Section ^. To summarize these results, it appears 
that the inequality A(e) > R(e) is satisfied for large e, and it is definitely not satisfied for 
small e. We now have rigorous results to confirm some of these observations. The strongest 
of these results is proved in Section |^: for e close to 0, there is no C > satisfying the 



inequality in Question 1.1: in fact, we show in Corollary 7.7 that for small e, A(e) is less 



than e 3 ~ i?(e) 3//2 . The numerics support an upper bound on A(e) that is exponentially 



small, and we show in Theorem |7.6| that on most of T e this is indeed the case. On the 
other hand, A(e) does appear to be positive for positive e in all of the range where we 
can meaningfully compute. Section [7| also contains a study of the bifurcation structure 
of fixed points inside the family J- e . We include this analysis because it sheds light on 
where, in both T e and 5 2 , new elliptic periodic points and their surrounding islands are 
produced. Homoclinic bifurcations also give rise to horseshoes, which are associated (at 
least heuristically) with positive measure sets with nonzero exponents. 



For the case of large e, we show in Section^, that an inequality like that in Question 1.1 



is satisfied when a small amount of noise is introduced. More precisely, we first prove in 
Section || some results about the quantities R(e) and A(e) and a third quantity R(e,5), 
which measures the exponents of the "in-between" process in which each element of T e 
has added noise in a 5-ball inside T £ (see Section |3] for details). In particular, we prove 
that any element h of a family T = 50(3)/ described above will have average 5-diffused 
exponents R(h, u, 5) that are positive, unless / is an isometry. In addition, there exists a 
stationary measure rrih g f° r such a process on the projective bundle PS 2 that is absolutely 
continuous with respect to Liouville measure and projects to Lebesgue measure \i on 5 2 ; 
this measure is unique among stationary measures with these properties and is the unique 
fixed point of a "simple" linear operator. The integrated measure 

m s = J rn h> sdv(h) 

determines R(v,5), which is the average of the 5-diffused exponents R(h,u,5) over h £ 
T . Whenever mg is equal to Lebesgue measure to, we have the equality: R(u, 5) = 
A(z^). For the family !F e , denote by m £: $ this integrated measure. In Section 0, we prove 



(Theorem 8.1) that for any 5 > 0, lim^oom,^ = m. Using this result, we prove that if 



enough noise is introduced, then the inequality in Question 1.1 is satisfied as e —> oo; in 
particular, we show in Corollary ^2] that R(e, S) — R(s) tends to as 5 — > and e — » oo 
sufficiently quickly, for instance e > 5~ 25 . Results of a similar nature for the standard 
family were obtained by Carleson and Spencer |CaSp| and are described in Section || below. 

We suspect that a further study of these measures m$ would be interesting. Even 
for an 50(n)- or 5J7(n)-invariant family of matrices, the properties of the analogous "in- 
between" measures mg are, for the most part, unknown. In Section ||, we discuss what is 
known about these measures. For 50(2), we prove that rag is Lebesgue measure for all 
5 > 0, and for SU(n), m$ is not Lebesgue measure if 5 is sufficiently small. 

We thank Roy Adler, Victor Klepstyn, Yuri Kifer, Marco Martens, Lai-Sang Young 
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2 Background on random transformations and exponents 

In this section we introduce some notation and gather together some facts and propositions. 
What we have to say in this section and the next is standard and can be found for example 
in |Kil], [Ki2], |Ki3], [Li-Qi|, [Ca|, [GoMa] in most cases in greater generality. We have 



outlined proofs here in order to be reasonably self contained. 

If TC C Diff (M), and v is a probability measure on TC, then (TC, v) generates a random 
process given by selecting an independent, ^-distributed sequence {hi)\ C TC and forming 
the compositions: 

ft,( n ) = h n o h n ~x o ■ ■ ■ oh\. 

To study all possible outcomes of this experiment, we introduce the following auxiliary 
spaces and transformations: the shift space, TC°° := W^L^TL, the one-sided shift a : TC°° <^ 
given by: 

a(h 1 ,h 2 ,...) = (h2,h 3 ,...), 
and the skew product r : TC°° x M given by: 

T((h i )?,x) = (a((h i )?),h 1 (x)). 

Then a has a natural invariant measure v°° , the product measure induced by u, but a 
priori r has no preferred invariant measure. 

Definition 2.1 Let u be a probability measure on TC C Diff(M). A measure fi on M 
is stationary for the random process given by {Ti, v) if any of the following equivalent 
conditions is satisfied: 

1. x fi) = v°° x fi 

2. ev*(u X n) = n, where ev : TC x M — > M is the evaluation map: 

ev(h, x) = h{x) 

3. jU ★ v = fx, where * is the convolution operator defined by: 

H*i,(A) = [ iJi(hr l (A))dv(h), 
for every jjL-measurable subset A C M. 



H 
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Stationary measures always exist [ Kil | and are the random analogue of invariant mea- 
sures in the nonrandom setting. Part of the focus of this paper is to find natural stationary 
measures in the case where M = T\S 2 and TC and v are derived from Haar measure on 
SO(3). 

Given an injective linear map A : V — ► W between normed vector spaces we denote by 
A§ the induced map from the unit sphere in V to the unit sphere in W, which is defined 
by v — > i u|$j i | ■ We use the same notation for the induced map on the projective space PV. 
We denote the tangent bundle of S 2 by TS 2 , the unit tangent bundle by TiS 2 , and the 
projective bundle by PS 2 . We let m denote the normalized Liouville measure on PS 2 , so 
m is a probability measure which pushes forward under projection to S 2 to The fibers 
of TS 2 and PS 2 over a point z G S 2 are denoted by T Z S 2 and P Z S 2 . For any manifolds 
M, N and differentiable map F : M — > N the derivative of F at x G M is denoted by 
T X F; for v G T^M we will usually write ll TFv" instead of T x F(v). Finally, we denote by 
-Fjj : T\M — > T\N the map that covers F and is (T X F)$ on the fiber over x G M. Since the 
tangent map to g G SO (3) preserves unit tangent vectors, we will write u g" for g*. 

Now let / G Diff A1 (5 2 ), let T = {g o f \ g G SO (3)}, and let v be the push-forward to 
T of Haar measure on SO (3). Let m be normalized Liouville measure on PS 2 . Associated 
to T we then have the set 

f i = {h i \heF} = {gof i | g£SO(3)}, 

and the measure the push- forward to Diff (PS 2 ) of Haar measure on .SO (3). Let 
<7 : < — 3 , t : x 5^ < — 3 , <7jj : J^jj 00 <-^>, and r tf : x PS* 2 be the associated auxiliary 
transformations to the random processes generated by (J-, u) and (J~$,vy) respectively. 

Lemma 2.1 The measures \x and m are stationary for v and v$ respectively. 

The transformations r, a and crj are ergodic with respect to v°° x fi, u°° , and , 
respectively. 

Proof: It is straightforward to check that they are stationary. 
Ergodicity is not much harder to check. QED 

Now we can compute R{y) more explicitly: 

Proposition 2.2 Let f G Diff^S 2 ), let T = {g o / | g G SO (3)}, and let v be the 
push-forward to T of Haar measure on SO(3). Let m be normalized Liouville measure on 
PS 2 , the projective bundle of S 2 . Then 

R(y) = I log||T/t;||dm(u). 
Jps 2 

Moreover, R(v) > 0, unless f is an isometry. 

Proof: We first apply Birkhoff 's Ergodic Theorem to the measure-preserving transfor- 
mation Tjj : JFjj 00 x PS 2 <r^ and the function 

i,{{h^)?,v) = \og\\Tfv\\ 
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to obtain that 



lim -log||T/ip...r/n(«)|| = lim - y2\og\\Tf{h jV --h n (i 

>■ ■ \ li T) — -»00 J) ' * 



3=0 



=■■ L((hi t )?,v) 
exists a.e. in J 7 ? x PS 2 . The integral of this limit L is 



%)r,»)d(f xm) = / v«° x ™) 

Jf^xPS 2 

= / log||T/u||dm(t;). 
Jps 2 



'PS 2 

Next, we apply Oseledec's theorem to the map r : J-°° x S 2 ^ and the cocycle 
((hi)f,x) i-> T x h\. We obtain that for almost all (hi)f, almost all x G S 12 , and for almost 
all v G Ti^S* 2 , the limit 

K^)? ,*) := lim - log ||TV 

p^oo p 

exists, is independent of v, and has v°° x ^-integral equal to R{y). The function: 

^((W) == / K((h i )?,x)d f i(x) 
Js 2 

lim - log \\Th p . . . T x h\{u)\\ dud^{x) 
65 2 J ueT , x s 2 p-*°° P 



L 

/ lim -log\\Th p ...Th 1 (v)\\dm(v) 

J PS 2V^oop 



lp S 2 P^OO p 

is (7-invariant and has z/°°-integral equal to R{v); ergodicity of a implies that it is a.e. 
constant and therefore equal to R(y). We conclude that 



R(v) = [ \og\\Tf(v)\\dm(v) 
Jps 2 



It remains to see that f ps2 log ||T/u|| dm(v) > if / is not an isometry. For this we 
use the following elementary lemma. 

Lemma 2.3 Let A G SL(2, R). Then the Jacobian of with respect to Lebesgue measure 
on S 1 is given by: 

JaciA^v) = \\Av\\- 2 , 

for v G S 1 . 
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Since / preserves [a, it follows from this lemma that the Jacobian of with respect to 
m at v E T\S 2 is ||T/(t>)|| -2 . Since T/j is a diffeomorphism, 

/ \\Tf(v)\\- 2 dm(v) = [ Jac(f t )(y) 
Jps 2 JPS 2 

= 1. 

By Jensen's inequality, 

/ log||r/(t;)||- 2 dm(«) < log(/ \\Tf(v)\\- 2 dm(v)) 
JPS 2 JPS 2 

= 

with inequality holding unless log ||T/v|| is constant and equal to 0. Rearranging the 
inequality, we see that, unless / is an isometry, we must have J PS 2 log ||T/(u)|| dm(v) > 0. 
QED 



3 A theoretical framework 

3.1 Connecting R(s) to A(e) 

In this section, we attempt to interpolate between R(e) and A(e) via a third quantity, 
R(e,5), which we call the random 5-diffused exponent. When 5 is greater than or equal to 
the radius of SO (3), R(s, 5) is equal to R(e); as 5 approaches 0, R(e, 5) approaches (in lim- 
sup) a lower bound for A(e). Roughly speaking, R(e, 5) is the exponent (averaged over 
obtained by introducing random perturbations (viewed as noise) of order 5 to each element 
of J- £ , staying within the family J- e . In Lemma |3.2| we show that limsup^ ^( e i ^) — ^-( e )- 
On the other hand, we derive in Proposition [D^ a formula for R(e, 5): 

R(e,5) = [ log\\Tfv\\dm E 4v). 
Jps 2 

The probability measure m £ ^ in this formula has nice properties: it projects to Lebesgue 
measure \i on S 2 , and is absolutely continuous with respect to m, with smooth density. 
Now, recall (Proposition |2.2|) that 

R(s) = / log || Tfv || dm(v). 
Jps 2 

If it were the case that m £t s — ► m as 5 — > 0, then it would follow that: 

A(e) > lim sup R(e, S) 

= limsup / log\\Tfv\\dm e s(v) 
8^o Jps 2 

= / log dm(v) 
Jps 2 

= R(e). 
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Hence the properties of this measure m £ s are potentially quite interesting with regard to 
Question 1.1 . 

Here we collect some properties of m e g and R(e,8). First of all, R(e,5) is always 
positive for 5 > (in fact, we prove in Corollary |3,4| that this is true not just on average, 
but for individual elements of T £ ). In other words, introducing noise (no matter how 
small) to an element h G T £ invariably produces positive exponents. 

The measure m £ .5 has additional properties as well. We prove that we can write: 

m £ ,s = / m g,e,8 dg, 
JSO(3) 

where, for each g £ SO (3), mg E $ is the unique probability measure on PS 2 with the 
properties: 

1- m g,s,8 is stationary for the (5-diffused process about gf £ ; 

2. rrig )£ ^ is absolutely continuous with respect to m, with smooth density; 

3. mg t£j s projects to Lebesgue measure /1 on 5 2 . 

As part of the proof, we show that each measure m g , £t s is the unique fixed point of a 
"simple" linear operator. 

Finally, from the way m £ ^ is constructed, it follows that m £ ^ shares all of the sym- 
metries of f £ . In particular, the density ip £ ^ is invariant under all rotations that fix the 
North pole. The further study of these measures m- 9 , £ ,<5 and m £: $ might be of independent 
interest. We discuss the linear version of m £: s in Section [|. In Section ||, we examine the 
behavior of m £ ^ as e — > 00. 

We now turn to the proofs of assertions 1.-3.. We first prove a standard semicontinuity 
result for random exponents. 

Lemma 3.1 Let {7^} be a sequence of probability measures on Ti C Diff(M) that con- 
verges weakly to a probability measure 7. Suppose that [i is stationary for the random 
process generated by (H, 7i), for every i, (for example, if Tic DiffAM)). Then 

limsupi?(7i) < R{pi). 



Proof: Let a n ((hi)f , x) = log \\T x h^ ||. Then a n : H°° x M -> R is subadditive with 
respect to r. By the subadditive ergodic theorem it then follows that 

Rfa) = [ lim —a n d(j°° x /i) 

= lim — / a n d(7°° x /i) 

n-oon J H °°xM 

= inf— / a n d(7°° x n). 
n n Jn°°xM 
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Now for any fixed n we have 



limsup-R(7j) < limsup— j a n d(pLX~fl 



So 



limsupi?(7j) < inf— j a n d(fiXj°°) 
H^l n n Jh°°xM 

= R(i)- 

QED 

We will apply this lemma to the situation where ji is supported on a small ball in T e 
converging, as i — ► oo, to a Dirac measure supported on an element of T £ . 

Let 5 > 0, and let Us be a symmetric S ball around the identity in 50(3). Give Us 
the restriction of Haar measure, normalized to be a probability measure and similarly for 
Fg,e,5 '■= Usgfe, for every g G SO (3) and e > 0. We denote this last measure by v g , e ,5- Let 
R(g,£,d) = R{v g , £ ,s)- 

Definition 3.1 The (5-) diffused random exponent is the average over SO (3) of R(g, e, 5): 

R(e,S) = f R(g,e,5)dv(g). 

Jg&SO{3) 

Lemma 3.2 



limsup / R(g,e,5) dg < A(e). 

8^o Jnesom 



8^0 Jg£SO{3) 

Proof: Note that lim^o u g,e,S = $gf s , Dirac measure supported on gf £ . By the previous 
lemma, 

limsup R(g, e, 5) = lim sup R(v gj£t s) 
8^0 8^0 

= Kofe) 

for each g 6 50(3), so the same is true for the integral. QED 

Now let ho : S 2 — > S 2 be any //-preserving diffeomorphism, and let H = Us ho. As in 
the previous section, define the space 

^tt = ih I h e H}, 

and evaluation maps 

ev : H x S 2 -► S 2 , ev$ : H x PS 2 -► PS 2 . 

Let vs and z^jj be the push- forwards of normalized Haar measure to H and 7Yy , respectively. 
Note that \i is stationary for the process generated by (H, vs). 
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Proposition 3.3 // ho is not an isometry, then for fixed 5 > the random process on 
PS 2 generated by ("Hfj, z^jj) has a stationary measure m$ that is absolutely continuous with 
smooth density, covers fi, and is the unique such stationary measure. 
Moreover, 

R{y s ) = [ log \\Th v\\dm s (v) > 
Jps 2 

Corollary 3.4 For fixed 6 > 0, e ^ and g £ SO(3), the random process on PS 2 
generated by (J- g , £ ,6, v g,e,s) has a stationary measure rn g ^ £ ^ that is absolutely continuous 
with smooth density, covers [i, and is the unique such stationary measure. 
Moreover, 

R(g, £,5) = / log ||T/ e u|| dm g . £ j > 
J PS 2 

Proof of Corollary |3.4| : Apply Proposition [T^ to the case where ho = gf e . 

Proof of Proposition |3.3| : Let ho and 5 > be given. We break the proof into steps. 

Step 1: Construction of mj. 

Recall that the convolution of a probability measure ir on and a probability measure 
m on PS 2 is a probability measure ir * m on PS 2 defined by 

ir-km{E)= / m(h7 1 E)d'K{hy) 

M 

for every m- measurable E C PS 2 . That a measure m is stationary for the measure tt 
is equivalent to the fact that ir^ * m = m. For k > 1 we let 7r fc * m = it * (-7r fc_1 * m). 
For any probability measure m on PS 2 any weak limit of the Cesaro sums ^ Yli K k * m 
is a stationary measure for tt. Beginning with a measure m which pushes forward under 
projection to /j, produces an invariant measure by this process with the same property. If 
we start with m as Liouville measure on PS 2 and tt = we call this limiting measure 
m s . 

Step 2: ms is absolutely continuous, with smooth density 

For any measurable set A C PS 2 , we have: 

m 5 {A) = (v s $ x m s )evQ 1 (A) 

= ( i/iji x ms){(h$,v)\h$(v) £ A} 

^{fyl I h( v ) G A } dm 5 (v) 

Now if the Liouville measure m(A) equals zero, then vs{h$\h$(v) £ A} must also be 
zero, for every v S PS 2 . Thus ms(A) is zero. It follows that ms(A) is absolutely continuous 
with respect to m. So there is a non-negative integrable function cpg defined on PS 2 so 
that for any measurable A C PS 2 , 

mg(A) = / tps(x)dm(x). 
J A 




11 



In Lemma 3.6 we will prove that <ps satisfies the following formula: 



m{B(x,d)) Jh- 1 B(x,S) 



1 



m(B(x,5)) J B {x,5) 



Of 

c Ps(h^z)3ac(h^)(z) dm(z). 



It follows now fairly directly that <ps is as smooth as ho, since the average over a <5-ball of 
an L 1 function is continuous, and of a C k function, is C k+1 . 

Step 3: R{ys) satisfies the integral formula, and the exponents of m$ are 
nonzero. 

The argument that R{i>s) = fps 2 1°§ H^^-cHI dms is now the same as in the proof of 



proposition |2.2| , using Birkhoff's and Oseledec's theorems, where T$ is replaced by TL^, v 
by us and m by mj. 

Next we will prove that the largest exponent is positive and from that we will deduce 
uniqueness. As in Proposition 2.2 , for any h = gho £ 7i the Jacobian of h* with respect 



to ms at the vector v in T\S is 

p(hv) = ^(V) = ^(gVj f6) 

^ ' 1 \\Thv\\*Mv) \\Thov\\*<p s (v) { ' 

provided that (pg(v) / 0. 

We claim that the function p(h, v) cannot be v$ x ms - almost everywhere equal to 1. 
Suppose for the sake of contradiction that p(h, v) = 1 a.e. Since <pg is continuous, if we 
fix ms-a.e. v, then the function p(-,v) is continuous. Thus, for m^-a.e. v, we must have 
p(h,v) = 1 for every h £ Us- 

Next, notice in the expression (H) for p(h, v), that the only term that depends on g G U$ 
is the numerator ips{gh^v). Rewriting this expression, we have, for almost every v in the 
support of ips, 

Vsigh^v) = \\Th v\\ 2 <ps{v), (7) 

for every g £ Us- Since ms projects to p, and (fs is continuous, we have that for p-a.e. 
x G S 2 , the set O = {v \ <ps{v) > 0} is an open, /^-invariant set in PS 2 that intersects 
almost every fiber. Equation (0) implies (fs must be constant on connected components 
of O, since varying g inside of Us, the vector gh^v covers an open neighborhood of h^v 
in PS 2 . 

But, again by equation (|7|), on each such component of O, we must have that ||T7io|| 
is constant. Since O intersects almost every fiber of PS 2 , we obtain that for almost every 
x € S 2 , there exists a connected open set I x in the fiber of PS 2 on which T x ho has constant 
norm. Since ho preserves area, we must have HX^/ioH = 1 for p-a.e. x, contradicting the 
assumption that ho is not an isometry. 

So p(h,v) is not a.e. equal to 1, and by Jensen's inequality, we have: 

log p(h, v) dvs(h) dms (v) < log / p(h,v) dvs(h) dms(v) 
HxPS 2 JhxPS 2 

= 0. 
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But 



R{y s ) = [ log \\Th v\\dm s {v) 
J PS 2 

log 1 1 Thv 1 1 ~ 2 dv 5 {h) dm 5 {v) 

2 JhxPS 2 V WW / 

log p(h, v) dus(h) dms(v) 



HxPS 2 

/ „ tr>:( h a (llW \ 

du^(h) dms(v), 



lb 



IHxPS 2 
> 0. 

(Here we used the stationarity of ms to conclude that the integral of log (ips(h$(v)) / ips(v)) 
is 0). 

Part 4: ms(A) is unique 

It remains to prove that the measure ms is unique among absolutely continuous sta- 
tionary measures which cover /x, now that we know that the random Lyapunov exponents 
are not zero. 

Let 7 be any such measure. Then, as for m$, there is a nonnegative smooth function 
if) such that for any measurable A C PS 2 , 



J (A) = / if}(v)dm(v). 
J A 

Let 7^ be the disintegration of 7 on the fiber Ti^ x S 2 . The density of j x with respect to 
Lebesgue measure on the fiber T\^ X S 2 is the restriction of ip to the fiber. 

We need the notion of a natural extension of a non-invertible transformation. Let 
($l,A,p) be a probability space with p-preserving transformation T : Q — ► Q. Let 

O = {(■ • • , (jj-i,u ) £ | T(u;_j) = Vi > 1}. 

Let T : Q be the map: 

T(- ■ ■ , cj_i, uj ) = (• • • , w , r(cj )), 
and let 7r : r2 — ► f2 be the projection onto the first factor: 

7r (- • • , w-i, w ) = ^o- 

Let „4 be the smallest c-algebra on f2 so that 7To and T are both measurable. On (Cl,A), 
there is a unique probability measure p, invariant under T, that pushes forward under 
7T to p. The measure-preserving system (fi,.A,p) ^ T is called the natural extension of 
(£l,A,p) T. The natural extension T is invertible, and ergodic if T is ergodic. 

Let r and be the associated auxiliary maps to the processes generated by (H, v$) 
and (7ijj, z^), respectively. The natural extension of rj with respect to the measure i/ag x 7 
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fibers over the natural extension of r with respect to uf x fx; the fiber over ((hi) ( ^' O0 , x) is 
Ti iX S 2 . For z>°°-almost every h = (hi)^^ and almost every fiber T\ jX S 2 there is a measure 
7h,x which is the disintegration of u°° x 7 along the fiber. Note that the extension v°° x 7 
is determined by this system of measures, and therefore so is 7. 

By (|LTQ|, Proposition 1.1, p.131), if (h = (fcO^.a?) 6 x S 2 = H™~x~S 2 , then 



7 X is the limit of the push- forwards: 

%,x = lm^(h_ ni ...h u )^ h -^ h -i (x) ) (8) 

Lemma 3.5 The limit ^) does not depend on the initial choice of absolutely continuous 
stationary measure 7 covering fi. 

Proof: The average exponents of r are nonzero, so the average exponents of f are also 
nonzero. It is easy to see that r is ergodic, and thus, so is f and the exponents of f are in 
fact nonzero /i-a.e. 

Let u(h,x) £ PS 2 be the unstable Lyapunov direction for f over (h, x). For every 
e > 0, there exists an n > 0, and a set G C T~l°? 00 x S 2 such that 

. v^{G) > 1 - 6 

• for every (h, x) £ G, the 7h,z-measure of an e-neighborhood of u(h, x) in Ti ;X S 2 is 
at least 1 — e. 

It follows that the limit in (||) is concentrated on the point u(h, x). QED 

Thus the natural extension of r and 7 is the same as the natural extension of r and 
ms, so r and ms are themselves equal. QED 



The next lemma completes the proof of Proposition 3.3 



Lemma 3.6 Let 7 be an absolutely continuous measure with respect to m on PS 2 that is 
stationary for vg. Then the density function ip defining 7 satisfies 

= iJf aw / 4>{y)dm{y). 

Proof: For any measurable set A C PS 2 , we have: 
ip(y)dm(y) = 7(A) 

A 

-1 



v s x 7 (e^ (A)) 
V5Xl{(h,y) £H t x PS 2 I h{y)eA} 
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i>(y)dm(y) du s (g) 

gdU s Jh m 1 g- 1 A 

^( h m9~ l {x))J{hQ}g~ l {x)Y l dm{x)dv s {g) 

'!-_( J A 

^( h ol9~ l {x))J{h^g~ l {x)y l dv 5 {g) dm(x) 



${x)= I ^(h-}g- l (x)J{h-}g- l (x))- l dv 5 (g). 



So, 

Since the integrand only depends on the point x E PS 2 we push the measure ug forward 
to PS 2 , use that Jig) -1 = 1 and that Haar measure on SO (3) pushes forward to Liouville 
measure on PS 2 to obtain 

Finally, changing variables one more time, gives: 

^( x ) = tJt — Fa / i>(y)dm(y). 



m(B{x,8)) J h -i B (x,8) 



QED 

This completes the proof of Proposition |3.3| 



Returning to discussion of the family J- E , we have verified that properties 1.-3. hold 
for the measures m g e $. Now let 



Since 



m £>s = / ra fl)E)( $ dg. 

>g£SO(3) 



/ R(g,e,5)dg = log \\Tfv\\ dm g ^ s (v) dg, 
JSO(S) JPS 2 



'SO(3) JPS 2 
it follows from Corollary |3.4| and Lemma [O that: 



Proposition 3.7 

> limsup / log \\Tf £ (x)v \\ dm Ei s 
<5^o JPS 2 

3.2 What is this measure m e < 5? 

For the family of twist maps J- £ under consideration, we prove in Section that the 



answer to Question LI is "no", and for small e, we have A(e) < R(e). It then follows 



from Proposition 3.7 that for small e, Lebesgue measure m is not a weak limit of m £t s 



as 5 — > 0. At the opposite extreme, we show in Section H that as e tends to infinity, the 
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measures m £) s do approach Lebesgue measure, for 5 > fixed. We hope that a future 
experiment will reveal more precisely how these measures behave in 5, for moderate values 
of e. As will be seen in section [6L the tiny differences between A(e) and R{e) for moderate 
and large values of s, are expected to give rise to numerical difficulties in estimating the 
behavior with respect to 5. 

In Section |], we show that in S*0(2)-invariant families of 2 x 2 matrices, if ms is the 
analogous "in-between" measure, averaged over the family, then ms is Lebesgue measure 
on S , for all 5 > 0. On the other hand, we also show that for unitarily-invariant families, as 
5 — > 0, the ms do not approach the natural unitarily invariant measure on the appropriate 
Grassmannian manifold, but instead they limit on an even "better" measure, in the sense 
that this measure forces the (strict) inequality A > R. We describe the construction of 
ms for matrices in Section [|]. 



4 The linear case 



Question 1.1 was originally motivated by a result of Dedieu-Shub about random and 
deterministic exponents for families of matrices. In this section, we describe these results 
and apply the framework of the previous section to the matrix setting. 

Let Li be a sequence of linear maps mapping finite dimensional normed vector spaces 
Vi to Vi+i for i £ N, Let v £ Vb\{0}. If the limit lim | log . . . Lo(v)|| exists it is 

called a Lyapunov exponent of the sequence. It is easy to see that if two vectors have the 
same exponent then so does every vector in the space spanned by them. It follows that 
there are at most dim(Vo) exponents. We denote them A,- where j < k < dim(Vo). We 
order the Aj so that Aj > Aj+i. 

Given a probability measure [i on GL(n,C), the space of invertible n x n complex 
matrices, we may form infinite sequences of elements chosen at random from fj, by taking 
the product measure on GL(n, C) N . Thus we may also talk about the Lyapunov exponents 
of sequences or almost all sequences in GL(n, C) N . 

For measures jx on GL(n,C) satisfying a mild integrability condition, we have, by 
Oseledec's Theorem, n Lyapunov exponents r\ > r<i > . . . > r n > -co such that, for 
almost every sequence . . . . . . g\ G GL(n, C), the limit 

lim-log \\g k . ..giv\\ 

exists for every v £ C n \ {0} and equals one of the fj, i = 1 . . . n, see Gol'dsheid and Mar- 



gulis iGoMaj or Ruelle (Ry| or Oseledec fO|. We may call the numbers t\ , . . . , v n random 
Lyapunov exponents or even just random exponents. If the measure is concentrated on a 
point A, these numbers: 

\i(A) = lim-log||^4 n u||, i = l...n, 
n 

are log \ e\\, . . . , log |e n |, where ei{A) = ej, i = 1 . . . n, are the eigenvalues of A written 
with multiplicity and \e\\ > | e2 1 > • • • > |e n |- 
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The integrability condition for Oseledec's Theorem is 

g G GL(n, C) — > log + (||<?||) is \i — integrable 

where for a real valued function /, / + = max[0, /]. Here we will assume more so that all 
our integrals are defined and finite, namely: 

(*) g G GL(n,C) — > log + (||<?||) and log + (||<7 _1 ||) are /i— integrable. 

In this matrix setting, there are rigorous lower bounds for the average exponents (= 
logarithms of moduli of eigenvalues) of unitarily-invariant families in GL(n, C). In [ DeSb| ], 
the following bound is proved: 

Theorem 4.1 [ DeSh| ] If fi is a unitarily invariant measure on GL(n,C) satisfying (*) 
then, for k = 1, . . . , n, 



/ vio g |Ai(A)|d M (A)>y>. 

' AeGL(n.C) J=1 J=1 



By unitary invariance we mean (i(U(X)) = fi(X) for all unitary transformations U G 
U(n, C) and all fi- measurable X C GL(n, C). 

We can rephrase a special case of this theorem in a form similar to Question 1.1. Fix 
A G GL(n, C). As above, let v be normalized Haar measure on U(n, C), and also denote 
by v the push-forward of v to the coset U(n, C)A C GL(n, C). Let iJ(-A) = be the 

largest random exponent of u, and let 



A(A) = [ log\ ei (B)\du(B). 

JB£U{n,C)A 



Then we have: 



Corollary 4.2 fDeShf For n>2, and for any A G GL(n, C), 

A(A) > R(A). 

Equality holds if and only if A £ U(n, C). 

Thus non-zero Lyapunov exponents for the family, i.e. non-zero random exponents, 
implies that at least some of the individual linear maps have non-zero exponents, i.e eigen- 
values of modulus not equal to 1. Hence the question that we posed for diffeomorphisms 
has a positive answer for sufficiently rich (i.e. unitarily-invariant) families of matrices. 



Remark 4.3 Theorem 4-1 is not true for general measures on GL(n,C) or GL(n,H) 
even for n = 2. Consider 

( 1 A a /ii 



v 1 1 J ' * V 1 

and give probability 1/2 to each. The left hand integral is zero but as is easily seen the 
right hand sum is positive. So, in this case the inequality goes the other way. We do not 
know a characterization of measures which make Theorem 4-1 valid. 
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We expect similar results for orthogonally invariant probability measures on GL(n, R) 
but the only case in which such a result has been proved is in dimension 2, where we have: 



Theorem 4.4 [ DcShj| Let fi be a probability measure on GL(2,R) satisfying 



g £ GL(2, R) — > log + (||g||) and log + (||g 1 ||) are fi — integrable. 

a. If fi is a SO(2,~R) invariant measure on GL + (2,R) then, 

log \\ 1 (A)\dn(A) = / / log ||Ac|| dx dfj,(A). 

AeGL+(2,~R) J A&GL+ (2,R) JxeS 1 

b. If fj, is a SO(2, R) invariant measure on GL~(2,R) ; whose support is not contained in 
RO(2,R) i.e. in the set of scalar multiples of orthogonal matrices, then 



L 



L 



]og\\i(A)\dii(A)> f [ log \\Ax\\dxdfi(A). 

JA£GL-(2,n) JxeS 1 



AeGL-(2,R) JAeGL~(2,n) JxdS 1 

Here GL + (2,R) (resp. GL~(2, R)) is the set of invertible matrices with positive 
(resp. negative) determinant. To rephrase Theorem 3 a., fix A £ GL + (2,R), and 
let v be the push forward of Haar measure on S'0(2,R) to the coset SO{2, R)A Let 
R(A) be the largest random exponent for the process induced by v, and let A(A) = 
/seSO(2,R)A lo § l e i(^)l dv(A). The we have: 

Corollary 4.5 |PeSh| , |AvBc] 1 For any A £ GL+(2,R), 

A(A) = R(A). 

We give an alternate proof of Corollary [4,5| in the following subsection. 
4.1 ms for matrices 

Let A £ GL(n,C) or GL(n, R) and /i be the Haar measure on U(n,C) or SO(n, R), 
respectively, normalized to be a probability measure. Let G denote GL(n, C) or GL(n, R). 
As we did for families of diffeomorphisms in Section 3, we now interpolate between random 
products and deterministic powers of matrices by changing /x. Let 5 > and G$ be the 
^-neighborhood of the identity in G. For g E G, GsgA is a neighborhood of gA in GA 
We normalize Haar measure restricted to Gs and push it forward to GsgA. Let us call 
this measure fig,g- Let ri(S, g) be the largest random exponent for this measure. At the 
end of this subsection, we prove: 

Proposition 4.6 lim^o ri (S, g) = log\ei(gA)\. 

Let G nt k(C) denote the Grassmannian manifold of k dimensional vector subspaces in 
C n , and let m be the natural unitarily invariant probability measure on G n ^(C). Any 
n x n complex matrix acts on the homogeneous space G n ^ by left-multiplication. For 
A £ GL(n, C) and P £ G ri ^{C), denote by A\P the restriction of A to the subspace P. 
Now let ms,g be the stationary measure on the Grassmannian G ri: i(= CP"""" 1 ) induced by 
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Proposition 4.7 n(5,g) = J PeGnl log ||(A|P)|| dms, g 

Let m$ = f geG vs,gdfJ-- It follows that: 
Proposition 4.8 J geG X 1 (gA)dfi = lim s ^ J PeGn l log ||(^4|P)|| dm s . 

Now 

rx= I \og\\(A\P)\\dm{P). 

So a comparison of f geG Xi(gA)d[i and r\ can be achieved via an understanding of the 
relationship between m and m$. We have two results in this direction. 

First, recall that inequality in Corollary |4.2| is strict when n > 2 unless A is an isometry. 
So for G = SU(n) the measures m$ favor the expanding directions of A as 5 — > 0. By 
Proposition |4.8|, we obtain the immediate corollary: 



Corollary 4.9 For G = SU(n), n>2, 



lim ms 7^ m, 

5^0 



unless A is an isometry. 



Experimentally the same seems to hold for SO(n) when n > 2, but we have not checked 
this very carefully. 

By contrast, the equality in Corollary 4.5 is consistent with lim^o m 6 = m - In fact, 
more is true: 

Theorem 4.10 For G = SO{2), 

m$ = m, 

for all 5 > 0. 

Combined with Proposition 4.5, Theorem 4.10| gives another proof of Corollary 4.5. 



Proof of Proposition 4.6: Following the proof of Lemma p.2| , one obtains that 

log | ei (gA) | > limr^g). 

S— >0 

If A is replaced by cA, for c G C \ {0}, then both sides of the equality change by log |c| . 
So we may assume that | det = 1, and it will be enough to prove that log |ei(<7A)| < 
lim^o^i(^fl') under the hypothesis that |ei(gvl)| > 1. 

Let E C C n be the generalized eigenspace of the eigenvalues of gA whose modulus 
equals |ei(gA)|. Then given e > 0, there is a metric on C n , a closed cone K C C n 
containing E in its interior, and a 5 > such that, for any B S GL(n,C) in the 
neighborhood of gA, we have: 

1. B(K) C K, and 
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2. \\Bv\\ > (\ei(gA)\ - e)\\v\\, for all v G K. 
It follows that, for any e > 0, there is a 5 > such that, 

\\B n ---B l \\ > (| ei (^)|- £ ) n 
for all sequences B\, ... , B n in U$gA. Hence 

n(6,g) > log(|ei( 5 A)| -e), 

and so 



lim r 1 (5,g) > log|ei(go)|. 



QED 



We now turn to the proof of Theorem 4.1C . 



Proof of Theorem |4.10| : By an argument presented in the proof of Proposition |4.q , we 
may assume that det A = 1. The projective action of SL(2, R) on RP 1 is conjugate to the 
standard action on the circle S 1 = {z S C | \z\ = 1} by linear fractional transformations. 
The conjugacy sends the rotation by 9 to rotation by 26. Let / : S 1 — > S 1 be the linear 
fractional transformation induced by A, and let T = {af \ a G S* 1 }. Let 

U a , 5 = {f3af\ arg(/9) E (-6,5)}, 

and let be normalized Lebesgue measure on U a> s, pushed forward from (—5, S). Denote 
by m Q s the stationary measure on S 1 induced by v a $. We will show that: 



m$ = / m a s da 
Js 1 

is Lebesgue measure on S . 

The same argument as in the proof of Lemma shows that for every a E S 1 , 



where 



dm a s(z) = ip at s(z)dz, 



fa,s{z) = 77? / <P a ,s(y)dy, 

16 Jye(af)-^B(z,8) 

and B(z,5) = {(3z \ arg(/3) G (-5,5)}. Setting 



we have that 



k s(z) = 7^1 B (l,6)(z), 



<Pa,5{z) = I k 5 (azf(y))tp a j(y)dy, 
lyes 1 



where we use z to denote the multiplicative inverse of z G S . Note that f sl kg(z) dz = 1. 
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Consider the following, more general setting. Let k be a non-negative function on 
S 1 such that J k = 1 and J k 2 < oo (all integrals are with respect to the normalized 
Lebesgue measure on S 1 ). For example, k = k$. Let / : S 1 i— > S* 1 be a fractional linear 
transformation, so that we can write, for \z\ = 1: 

n>0 

For aeS 1 , define the operator L a j on real functions on S 1 by: 

L a j<p(z) = J Hazf(y))<p{y)dy. 

The operator L a j is a positive operator, j L a jip = J tp. There exists a unique function 
ip a satisfying L a jip a = ip a and J <p a = 1. The function 99 Q is positive and continuous, 
upper and positive lower bounds for ip a can be chosen uniformly in a. In the case k = ks, 
we have ip a = (p a j- 

Proposition 4.11 We have, for all z G S 1 , f (p a (z)da = 1. 

The proposition follows directly from the following two claims: 
Claim 1. For all m > 0, all z € 5 1 , / L™ f l{z)da = 1. 

Claim 2. The sequence ^ J]m=i converges to <^ Q (£) in L l (du,dz) as n — > 00. 

Claim 2 follows from the ergodic theorem for Markov (i.e. LI = 1) operators applied 
to the operator ip(a,z) — > — — L„ f(ip a ip(a, .))(z) and the initial function — — . 

In order to prove Claim 1, we compute, for a function 95 G L 2 , ip(z) = J2n^z n , the 
Fourier coefficients 7^ of the function L a jip. We find, for n > 0: 

in = j z n L aJ ^(z)dz 

= J z a k(azf(y))Lp(y)dydz 

= a n k(-n) J {f{y)) n v{y)dy 
= a n k(-n) I Y,^y k ^y)dy 

J k>0 

= a«fc(- n )^4 n) 7fe 



and, analogously: 



7 '_ n = a- n k(n)J2^\- k , 

k>0 
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where we wrote (f(z)) n = Ylk>o c iT' (/( 2: ))° = 1j an d ^ ^ s the Fourier transform of k. 

Iterating these formulas, we obtain for a function cp G £ 2 , <p(z) = Yl n ^z n , the Fourier 
coefficients 7™ of the function L™ *ip: 

7 < m > = a n fc(-n) J2 (K=i» ns H-n s ))c^ . . . c^ lrim 



rti,...,n m >0 



for n > 0, and 



rai,...,n m >0 

for n < 0. 

To get the Fourier coefficients of the bounded continuous function J L™ fl(z)da, we 
integrate in a the Fourier coefficients of the bounded continuous functions L™A(z). In 

the above sum, all terms vanish, except the ones with n + n\ -| + n m ~\ = 0. Since all 

Hi have the same sign, the one nonzero integral corresponds to n = n\ = ■ ■ ■ = n m _i = 0. 
Claim 1 follows. QED 



Remark 4.12 Theorem holds even without randomization. Suppose that A has de- 



terminant equal to 1 and let O vary over SO(2, R). Then for almost every O the eigenvalues 
of OA are either complex with irrational argument or real and there is one eigenvalue of 
modulus bigger than one. In the first case Cesaro sums of the push forward of Lebesgue 
measure by OA§ converge to the unique invariant measure of OAp In the second case to 
the Dirac measure supported on the expanding eigenspace. Call these measures moA, then 
f moAdO is Lebesgue measure. The proof is the same, but easier. 



5 Experimental Method 

5.1 Haar Measure on SO (3) 

It is clear that an element of SO(3) is determined by its axis and angle of rotation. Here 
we describe how to pick axis and angle uniformly with respect to Haar measure on SO(3). 

Let S 2 be the usual two sphere with measure \i. Let S 1 be the usual unit circle of 
angles from to 2ir given the probability measure with density function (1 — cos#)/(27r). 
Let S 2 x S 1 be the product space with the product measure which we denote by 7. There 
is a natural map P : S 2 x S 1 — > SO (3) which maps a vector x and an angle 9 to the 
orthogonal transformation which fixes x and rotates by angle 9 around x according to the 
right hand rule. The map P sends (x, 0) to the identity in 5*0(3) for all x G S 2 and is two 
to one when 9^0, (x,6) and (—x,—9) map to the same point. P maps 7 to the Haar 
measure on 5*0(3). If we identify (x, 9) ~ (— x, —9) we obtain 5 2 x 5 1 / ~ which is a circle 
bundle over real projective 2-space. P induces a map P~ : S 2 x 5 1 / ~— > 50(3). 

Proposition 5.1 P~ is one-one off of the zero section of S 2 x 5 1 / ~ and gives a mea- 
surable isomorphism between (S 2 x 5 1 / ~,7) and (50(3), Haar). 
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Proof: That P~ is one-one off the zero section of S 2 x 5 1 / ~ is easily verified. 

Fix the standard product metric on S 2 x 5 , normalized so that each factor has total 
volume 1. Normalized Haar measure on 50(3) is Riemannian volume with respect to 
the bi-invariant metric we now describe. The Lie algebra of SO (3) is the space of anti- 
symmetric matrices so(3); on this algebra, we put the inner product: 

<A,B>=^tr(AB t ), 

where c = 2tt 2 ^ 3 . An orthonormal basis for so(3) is {X,Y, Z}, where 



X 



and 








c 




























: 


c 


o J 




V c 





: 




(Note [X, Y] = cZ, etc). In the bi-invariant metric induced by this inner product, 50(3) 



has constant sectional curvatures, see [DoC]: 



c 2 _ \\[X,Y}\\ 2 _ \\[X,Z}\\ 2 _ \\[Y,Z}\\ 2 
4 4 4 4 

The diameter of 50(3) in this metric is r = tt/c and the total volume is 

sin 




i. 



Let p(x, 9)d[x{x)d6 be the pullback of the volume form on 50(3) to S 2 x 5 1 under P. 
To prove that P~ is an isomorphism, it suffices to show that p is the density of 7, that is, 
to show that: 

p(x, 6) = (1 — cos6)/ir, 

for all (x,e) G S 2 x 5 1 . 

For any x,y 6 5 2 , if iV = A x 6 C S 2 x is a product neighborhood of (x,0) 
of Lebesgue measure (5, then there exists a g £ 50(3) such that iV = gA x © is a 
neighborhood of (y, 0) of Lebesgue measure 5. From the definition of P, it follows that 
P(N) = gP(N)g~ l . Since p(x,6)dfi(x)d9 is the pullback of an SO (3)-invariant form, we 
obtain that for any x, y £ 5 2 , 6* S 5 , 

p{x,0) = p(y,0) =:p(9). 

Finally, we compute /o(0). Since the geodesies of 50(3) through / are precisely the 
one-parameter subgroups, the image under P~ of the curve 1 1— > (p, t) is a geodesic through 
/ of speed 1/c. It follows that for any 9 G (0, tt), P~ sends 5 2 x (0, &)/ ~ diffeomorphically 
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onto B c -\q(I) \ {0}, the punctured ball of radius c 1 9 about the identity in SO(3). The 
volume of such a ball is 

r e 

p(t)dt = vol(5 c -i fl (/)) 

^ \ Vx J 

1 



o 



TT 



-(0-sin0). 



By the Fundamental Theorem of Calculus, 



p(0) = -(l-coe(0)), (9) 

TT 



which completes the proof. QED 

We call P or more appropriately P~ polar coordinates on SO(3). 

5.2 Computing Random Exponents 



Recall that R(e) = f ps2 log ||T/ e t;|| dm(v). Using the results of AvBc ] we can reduce the 



right hand integral to a one variable integral which we can then evaluate numerically very 
accurately. This is how the random Lyapunov exponents are computed. 



Proposition 5.2 

, i 

' 2 



R(e) = / log(l + (2vrex(l - x)) 2 ) dx. 
Jo 

Proof. Consider the "inverse Archimedean projection" 

^ : (9, x) i ^ (g(x) cos 6, g(x) sin 9, x — -), 



where g(x) = \/x(l — x). This map sends the cylinder C = S 1 x [0, 1] onto the sphere 
S 2 and is area-preserving: the pullback ^*dfi is a multiple of the Lebesgue volume form 
d9 dx on C. The Riemannian metric on S 2 pulls back to the metric: 

<v,w >(8,x)= v t B(x) 2 w, 

where 

B(x) = [D^ix^D^^)] 112 = ( 9{ * ] (2 0(°))-l 
Setting f £ = o / e o $, we have that: 

f e (0,x) = (9 + 2Trex,x). 
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We next compute: 

22(e) = [ log \\Tf £ v\\dm(v). 
J PS 2 

log||T/ e «|| ^dm(v). 

In the second equation, the unit tangent bundle T\C and the quantity ||T/ e u|| are defined 
with respect to the ^-pullback Riemannian metric on C. 

If v S TC is a unit vector with respect to the pullback metric, then u = Bv is a unit 
vector with respect to the Euclidean metric, and ||T/ e v|| = ||-BT/ e i? _1 ti|| Eucl . Hence we 
can write: 



R(e) = — I [ log \\BTr x 6 )f e B 1 u\\ Euch dx d6 du 

27r J(x,e)£C Juts 1 



(x,9)ec 
27r J(x,e)ec Juts 1 



/ I log || ( J ^ 7T£X (^ x ) j ,,11,. ,,,, d.rdOdu. 



For A E 5L(2,R), [|AvBo|| show that 

/ log ||Au|| Eucl . du = log((||A|| Bucl . + \\A\\~l cl )/2). 

Ju&S 1 



Applying this to A = ( ^ ) , we obtain that 



f log \\ Au\\ EucL du = I log(l + a 2 / 4). 
JueS 1 2 



It follows that: 



R(e) = J J ^ log 11 ^ q 47r£X ^ u\\ Euch dudx 

1 r 1 

= - log(l + (4tt£x(1 - x)) 2 /4)ofe 

2 Jo 

/•1/2 

= / log(l + (2vrex(l - x)) 2 ) dx. 

JO 

6 Experimental Results 

In this section we describe several numerical experiments and their results. First we obtain 
experimental values of 22(e) for random maps as introduced in Section ^. We use a method 
similar to those we use later to compute estimates for A(e). Proposition 5.2 allows to us 
to check the accuracy of the computed estimate of R(e) against the precise value of R(e) 
given there. Then we pass to the computation of A(e). Three different approaches to the 
computation of A(e) are presented and discussed which will allow us to obtain accurate 
enough values to draw conclusions. Finally, a sample of numerical estimates of A(e) is 
shown. 
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6.1 The case of random maps 



To obtain experimental values of R(s) for different e we proceed as in Proposition 2.2 . A 
random point x in S 2 and a random vector £ in T\^ X S 2 is chosen. A random sequence gi in 
50(3) is selected and the derivative of the maps gif e are applied to the tangent vector £. 
The rate of increase of the logarithm of 11^/^(^)11 is described, where f( n > = g n f £ . . . g\f £ - 
The results are the same with probability 1. For brevity, we refer loosely to the use of 
formula (Q). 

Let us describe the selection of random elements: 

• An initial point can be described in polar coordinates by a longitude X x and a 
latitude (3 X . The value of is chosen at random in [0, 2ir] with uniform probability. 
Concerning (3 X , a random value z £ [— 1, 1] is selected with uniform probability and 
then we let (3 X = sin -1 z. This gives the uniform probability for x G S 2 . 

• A tangent vector £ £ T\^ X S 2 is generated by choosing an angle ip G [0, 2ir] with 
uniform probability and letting £ make an angle t/j with the unit tangent vector 
to the latitude through x taken in the positive sense. We call this last vector the 
horizontal vector at x. It is not defined at the poles, but as the poles have measure 
zero this is irrelevant in the current context. 

• A random rotation g £ SO(3) is determined by an axis and an angle of rotation. 
As described in subsection [5,1| one can take these as elements in S 2 x S 1 . It is not 
necessary to carry out the identification described there. The axis is selected just as 
the point x £ S 2 was above. Let 6 be the rotation angle. To select it according to 
formula (||) pick a random value z £ [0, 2-7r] with uniform probability and solve the 
equation z = — s'm(6) for 9. The equation is nothing other than the well known 
Kepler equation with eccentricity equal to 1. There are efficient solvers for it. 

Then, given initial values of (x, £) one can apply formula (||) to approximate R(x, e) by 
using a finite number of iterates, N. In turn, to approximate R(e) the integral in formula 
dH) can be computed using a sample of size M in PS 2 . Let Rn,m{z) be the value obtained. 

This value is compared to the one given by Proposition |5.2| which has been computed 
using a Simpson method with iterative mesh refinement (the values of R{e) are shown in 
figure XXX). The following limit approximations are straightforward to derive: 

R(e) = fje 2 - |fj e 4 + 0(e 6 ) for e - 0, 

(10) 



R(e) = l g(2vre)-2 + -L + 0(^) for 



oo. 



2e 

Skipping the O terms one has relative errors less than 0.01 if e < 0.30 and e > 3.19, 
respectively. 
Let 

A NM {e) = R NM (e) - R(e). (11) 

We have verified experimentally that Atv,m( £ ) has essentially zero average and a standard 
deviation like 

aNM{£) ~ vm> 
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provided N is large enough. 

Tests have been done for several choices of N, M and e. Using M = 10 3 and N = 
10 fc , k = 2, . . . , 6, the values of k(s) have been estimated for e ranging from 10" 1 to 10 3 . 
There are no significant differences from k = 4 on. Figure xxx displays, for different values 
of £, the interval [— 3k(e), 3k(e)] and the results of single runs (i.e., taking M = 1) for 
N = W k , k = 4, . . . , 9. More concretely, the plotted values are the deviations Atv,i(£) 
given by formula ( |TTD multiplied by \/N. 

These results indicate that Rn,m( £ ) and R(e) agree to order 10 -5 taking NM = 10 10 . 
Further checks have been done for larger values of e (up to e = 10 6 ) which show no 
significant variation of k(s) between e = 10 3 and e = 10 6 . 

6.2 Computing the Lyapunov exponent in the deterministic case 

In principle one can follow a similar scheme to compute A(e). That is, using (|l|) with a 
finite number of iterates, N, an approximation of X\(x, g o f e ) is obtained. Then, (||) is 
computed using a Montecarlo method sampling x G S 2 as described with samples of size 
M p . Finally an estimate of A(e) is computed by applying again a Montecarlo method to 
(|3|), sampling g G SO (3) as explained above and using samples of size M r . In any case the 
samples are taken according to the appropriate measures. The total number of iterates of 
the maps and their differential is, hence, M r M p N. Let us denote by A.M r ,M p ,N( £ ) a value 
obtained in this way. 

Results of this approach are shown in figure xxx for different values of e. They require 
some explanation. For a fixed g G SO(3) the values of Xi(x,g o f e ) are estimated for M p 
random values of x. The standard deviation of the values of Ai (x, go f e ) is then computed. 
This value, a g , depends of the choice of g. Let a S 2 be the average value of a g when a full 
sample of g G SO(3) is considered. On the other hand all the M r M p determinations of 
Xi(x,g o f £ ) can be used to estimate a global standard deviation, <r total . 

It is clear that ag2 measures the average dispersion of the maximal Lyapunov exponent 
when different points are taken in the phase space. The dispersion depends on the concrete 
rotation g taken. Typically, for the g such that relatively small values of the average 

fe) °f ^1(^)5 ° fe) on S 2 are obtained, it is seen that a g is larger. It should also be 
mentioned that the errors in the determination of Ai(x, go due to the finiteness of the 
number of iterates N, also contribute to this dispersion. 

On the other hand there is also a dispersion in A(^o f £ ) when different g are taken. 
The standard deviation <7 total measures the cumulative effect of both dispersions. 

Now let us make several comments on the observed behavior, based on computations 
carried out with quite different values of M r ,M p and N. 

• The estimates of A(e) are close to R(e) for s large. For instance, for e > 3 one has 
that |A5ooo,5000,8i92(e) — R(, £ )\> m the runs done, is below 0.00325. The situation is 
worse for small e because the difference reaches the value —0.105 for e = 0.55. If 
we proceed to compute the relative error (r.e.) = (A5ooo,5000,8i92(e) — R( £ ))/R( £ ) 
the agreement is even worse. For e = 1 one has r.e. ~ —0.14 and r.e. < —0.9 if 
e < 0.42. In these comparisons one should take into account that the discrepancies 



also include the errors done in the estimates of A(e) (see Sections plfl and 6.4). But 
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the present results already indicate that the differences for e small are not only due 
to statistical errors. To make this more evident some additional computations have 
used a total number of iterates (for some selected e) with M r M p N largely exceeding 
10 12 . 

Furthermore it seems also clear that for e < 0.3 there is a "saturation" in the 
behavior of the estimates of AM r ,M p ,N( £ ) and of the standard deviations. Indeed, it 
can be seen that the trend on the figure changes completely (this is also the purpose 
to use logarithmic scales). Systematic errors occur which completely invalidate the 
statistical results. To make this more evident some values, for e in the range [0.2, 0.3], 
computed also in a probabilistic way but with a different estimator (see Section 
|6.3j ) are also shown as dots in the lower left part. For these computations M r = 
14400, M p = 16384 and N = 16384 have been used. 

The maximal value Ai jmax (e) of Xi(x,g o f £ ) for x G 5 2 and g £ 50(3) is larger 
than R(e). This requires samples with M r ,M p large to be detected if e is small. 
It will be clear from Section ^ and the upper formula in ( [To|) that the quotient 
q(e) = Ai imax (e)/.R(e) is unbounded if e — ► 0. If large values of e are considered, it is 
observed that q{e) tends slowly to 1 when e — > do. In fact it follows from the lower 
formula in ( JlOj ) and the analysis in Section |7] (which is partly valid for any e) that 
the difference Ai jmax (e) — R{e) is bounded by 2 — log(2) + 0(e _1 ). To see differences 
close to the bound one has to use very large values of M r ,M p . 

• The estimates of <r S 2 are mildly sensitive to the concrete values of M r ,M p and N, 
provided these values are not too small, and assuming 0.3 < e < 3. For e > 30 a 
clear dependence with respect to N, of the form iV -1 / 2 , is seen. From e = 3 to 
e = 30 there is a gradual increase in the dependence with respect to N. For e < 0.3 
a tendency towards a behavior of the form N , which increases when e decreases, 
is clear. 

If extremal values of a <j2 are considered when a sample of g is taken, it is clear 
that the minimum must be zero. But there is a significant difference for e < 2, 
because the minimum is already close to zero for samples of moderate size, while for 
larger e the minimum, which is almost insensitive to e goes to zero slowly when M r 
increases. On the other hand, the maximum of the observed values of a 52 increases 
until e ~ 10. It only stabilizes to a value with small dependence on e for e ~ 100. 

• The values of cx total are much larger than cxg2 for e > 1, while for e < 0.3 they are 
essentially equal. In particular they have a small dependence with respect to M r , M p 
and N, if these are not too small and assuming e > 0.3. 

An analysis of the reasons of the observed behavior is useful because it helps in three 
different aspects: a) to understand the different contributions to the errors in the estimates 
of A(e); b) to see the main differences between the cases of random and deterministic maps; 
c) to suggest alternative methods to obtain better estimates. 

• In the deterministic case the dynamics on 5 2 for a given rotation g is relevant. This 
dynamics is "destroyed" (or "smoothed" , "averaged" ) in the case of random maps. 
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Hence, the initial point is irrelevant and the estimates improve in a probabilistic 
way, depending on the total number of iterates MN, in the random case. 



Changing g in the deterministic case produces dramatic changes in the dynamics 
and, hence, on X(g o f £ ). This is specially clear for g close to the identity (axis close 
to the pole or small rotation) or for rotations of angle very close to n around an axis 
of small latitude. The variability of X(g o f £ ) with respect to g is a major source 
of dispersion in the results, specially for large e. A standard deviation <7 total around 
0.4, mainly due to the variation of \{g o f £ ), would require sampling with M r of the 
order of 10 8 , at least, to have accurate results. 

For a fixed g, changing x 6 S 2 has a very mild effect for large e. Despite the possible 
existence of tiny islands (see Section |7|) the dynamics "looks " ergodic. A similar 
behavior has been observed for standard-like maps in flGLS | and in the case of volume 



preserving flows it is seen in [ NSV ] , where an analysis of the places where the islands 



should be expected is carried out before finding them explicitly. 

Figure xxx displays a sample of orbits in S 2 for fixed g and two different values of e. 
For e = 0.3 the dynamics is mainly dominated by an integrable behavior, with many 
invariant curves and small chaotic seas, the largest one seen in the front part. This is 
persistent with respect to changes in g. The system is even more integrable (that is, 
invariant curves fill up a larger area) for most of the rotations g G SO (3). For e = 2 
(which is not so large!) only minor islands subsist, and they can even disappear for 
different g. For values like e = 10 it is hard to see any island unless g is selected on 
a set of small measure. In the random case one observes a uniform distribution of 
iterates in S 2 and the same is essentially true for large e in the deterministic case. 

• On the other hand, for fixed g and small e the value of Ai (x, g o f e ) depends strongly 
on x. But the behavior is typically rather sharp. Either one obtains a moderate 
value of the order of e 15 or it is zero. The smaller the value of e, the larger the 
measure of the x with exponent zero, of course. The average value can be very small 
and despite the standard deviation a S 2 also being small, large samples with respect 
to x have to be taken if small relative errors are desired. 

• The worst point concerning accuracy, especially for small e, are the errors in the 
computation of \\{x,go f e ). Indeed, for an integrable motion (e.g., x in an invariant 
curve) Ai is zero, but the estimates ^ log ||T, c / ri || are, generically, of the order of 

log ^ . This implies that the convergence to zero is slow. For large e this effect 
is relatively not so dramatic, but oscillations in the behavior of the quotients and 
different trends can be expected. 

6.3 Improved procedures 

Several alternative procedures have been used in previous computations to determine the 
maximal Lyapunov exponent of a given map, averaged on the phase space: 
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1) If the dynamics has a uniform hyperbolicity but with superimposed strong periodic 
or quasiperiodic oscillations, the following strategy has been used in [BS|. It consists 



in detecting, by an iterative procedure, an upper envelope of the plot of the quotients 
— log 1 1 Tn;/" || as a function of n. Then, and after skipping a transient regime, one fits 
a function of the form a + (3/n to the envelope. The value of a is a good estimator 



for Ai(x,/) and, as the system in |BS| is a skew product with linear action on the 
fibers, the value of x is irrelevant. 

2) If the values of Xi(x, f) depend strongly on x, it is possible to divide the phase space 
in pixels of a given size (in general, d-dimensional pixels) and start the computations 
at a point in each pixel. However, if the number of pixels is large and the system 
depends on several additional parameters, the method can be prohibitive from a 
computational point of view. Then, together with each initial pixel one considers 
all the pixels visited by the orbit. The estimated Lyapunov exponent is assigned to 
all of them. One requires each pixel to be visited a minimal number of times (in 
case of need one takes several initial points in the pixel) and an averaged Lyapunov 
exponent is assigned to the pixel. Later on this is averaged over the full phase space. 

This method has been used in [3S| to study the classical Hill's problem and how the 



degree of chaos behaves with respect to the energy. 

One should also take into account the stickiness of invariant curves. An initial 
point in a chaotic sea can remain close to an island for a large number of iterates. 
Hence, it is a good strategy to take a larger number of initial points even if one has 
to decrease the number of iterates for each one, provided this number is not too 
small. Furthermore, the local slope of log ||T.j;/ n || can have quite different trends 
if the number of iterates is large. Statistically this is not a problem because the 
interesting magnitude is the average behavior. 

3) The previous method still suffers from slow convergence of the quotients to Xi(x, /). 
An alternative method has been used in the context of flows with applications to 



galactic potentials in [CiSi| and later on extended to discrete transformations in 
CGS|, where references to other applications can be found. It is mainly intended to 
discriminate between regular and chaotic motion (that is, to decide if one can accept 
Xi(x, f) = or not), but it also supplies an estimate of the Lyapunov exponent. 

Given a map /, an initial point x on a manifold A4 and a random vector £ G T\ )X M, 
let £o = £ and define ^ = (Tfk-ir x \f)(£k-i)- F° r fixed integers m and n and N > 
let 

N ( U II \ _ N 

Y m , n (N) = N n Vlog -^-r k m , Y m . n (N) =Vy m , n (fc). 

Then, for a chaotic orbit the estimator 

Y m , n (N) = (ro + l)(m + n + 2) ^^ (12) 

tends to X\(x, /), while for a generic regular orbit it behaves like ^mlm+n+V) 2 ^ W ' 
The basic idea is to average the exponential rate of increase of the length of 



30 



so that the transience has small relevance and to smooth out the irregularities of 
the quotients. Hence, it is a measure of the mean exponential growth of nearby 
orbits (MEGNO) and depends on the couple (to, n). Suitable values (according to 
numerical experience) are to = 2, n = 0, and then it is denoted as MEGNO20. 

For (m,n) = (2,0) a slightly better estimator for A\(x,f) is obtained by using 
^ N^+iN^+bN 12 • Furthermore, when this method is used with these (to, n), one can 
check for a behavior of the form to decide Xi(x, f) 0. Typically Y2fi(N) — j? = 
0(N~ 2 ) for regular orbits. 

Given a maximal number of iterates iV max to be used in the estimates, an additional 
question is whether it can be better to use another value N < iV max as a better choice 



to estimate Xi(x,f). In |CGS| a "right stop" criterion is introduced. It is specially 
relevant if the orbit is close to be regular, to prevent an overestimate of Ai(x, /), but 
it has not been used in the present computations. 

Figure xxx illustrates the different behavior of MEGNO20 and the quotients in 
formula (||) in several cases. Details on the parameters used for the plots are as 
follows. 

On the upper row, left plot, for e = 0.3 a rotation of angle 9 and axis of latitude j3 with 
^ = ^g, j- = H<? where g = (V5 — 1)/2 has been selected. Random initial conditions are 
chosen. After a transient of 512 iterates, estimates of Xi(x,f) are produced and plotted 
for the next 2048 iterates. Solid lines (the lower ones) correspond to the estimates using 
MEGNO20, while discontinuous lines are produced by the classical formula. The middle 
part displays a similar plot for different initial conditions and the right part is similar to 
the middle one but for e = 10. 

On the lower row estimates of the largest Lyapunov exponent for random initial con- 
ditions are shown. On the left part e = 0.3 and 8 and j3 as before have been used, and 
512 random initial points are plotted. Solid (resp. discontinuous) lines correspond to 
MEGNO20 (resp. classical) estimators. Transient and number of iterates are as before. 
The left (middle) plot in the upper row corresponds to number 4 (43) of these points. The 
middle part is similar but for e = 3. For completeness estimates in the random case using 
finite n in (|I|) are also shown on the right for e = 0.3 . Each iterate uses a random element 
in SO(3) with density p{j3,6) = cos(/3)(l — cos(0))/(27r). The solid line shows the value of 
i?(0.3) w 0.0547518. 



See also |S2| for additional methods and applications. 

Due to the good properties of the procedure, estimates of \\{x, g o f E ) have been 
computed using procedure 3) above. For the integrations in S 2 a Montecarlo method has 
been used. This is good enough for large e, because of the mild dependence of Ai(x, g o f £ ) 
with respect to x (for, say, e > 1) for most of the i£S 2 and most of the g G SO(3). For e 
small and especially if e < 0.3, a method such as the one presented in 2) would be suitable, 
but there are additional problems, due to the smallness of the Lyapunov exponents, to be 
discussed later. Furthermore it will turn out that it is relevant to compute A(e) with a 
small relative error for large e, to allow for a careful comparison with R(e). But for e small 
it will be clear from the results, even those obtained with a moderate accuracy, that A(e) 
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is far below R(e). In any case, it seems that numerical estimates of A(e) for e < 0.3 with 
small relative error require an enormous computational effort. 

Finally, for the integrations in SO (3) and taking into account the large standard devi- 
ation found for moderate and large values of e, it has been found more convenient to use 
numerical quadrature formulas based on a grid of points. More concretely, a product Simp- 
son method has been used with respect to the latitude (5 g of the axis of rotation and the 
rotated angle g . The longitude of the axis is irrelevant: changes in this longitude are equiv- 
alent to changes in the longitude X x of x G S 2 . Using a grid with g 6 [0, 2n], (3 g £ [0, ir/2] 
requires that the estimate of X(g o f e ) be multiplied by the factor cos(/? 9 )(l — cos(# g )). 

Initial estimates for the results shown in the next Section use, for the elements in 
50(3) the following data: A = 0, (0,0) on a grid of N g x N g with N g = 200. Then a 
sample of N p = 1600 random initial points and tangent vectors in PS 1 and M = 8000 
iterates are used. The programs have been done in such a way that beyond the estimates 
for these values, also estimates using grids with N g = 100, 50 or using samples with N p = 
800, 400, 200, 100 and doing a number of iterates equal to M = 4000, 2000, 1000, 500 are 
computed. This allows for a check on the internal consistency of the results. 

It turns out that the use of different grids in SO (3) stabilizes quickly. Concerning 
the dependence with respect to N p and M, it is clearly seen that there is no need for 
very large values of N p except in the case of small e and one is interested in having small 
relative error. The dependence with respect to M is clearly of the form ctant/M. Hence, 
extrapolations with respect to M have been used. The initial estimates allow for a fine 
tuning of the most suitable values for the grid, N p and M. For instance, assuming that 
one can accept a total of 2 41 iterates (for every value of e), for large e a typical choice is 
N g = 2 8 , N p = 2 9 , M = 2 16 , while for small e it is N g = 2 7 , N p = 2 10 , M = 2 17 . Even 
with this large M the results start to be not very good if e < 0.2 . 

Figure xxx shows 3D views of h(0 g , (3 g , e) = \{gof e ) cos(/3 s )(l — cos(0 g ))^ as a function 
of (Qg, (3 g ) for different values of e. Level lines of these surfaces are displayed in figure xxx. 
The plots give a good evidence of the smooth behavior of h(0 g ,/3 g ,e) for moderate and 
large values of e, and how the behavior becomes wilder, with sharp changes for small s. 
It is clear that the results are the same if (0,13) is replaced by (2ir — 9, —(3). Furthermore, 
note that for integer e the symmetries of g o f e imply that the results should be the same 
if is changed to 2tt — 0, as clearly seen in the first three plots. For e = 0.5 the lack of 
symmetry 9 «-> 2ir — 9 is clear, but for large non-integer e this lack of symmetry is harder 
to detect. 

6.4 A sample of results 

The results of applying the methodology just described are shown in figure xxx. Typical 
values for the number of iterates, initial data and grid have been given before. 

On the top plots general views of R(e) and estimates of A(e), to be denoted as A(e) num , 
can be seen. In particular, for large and small e it is easy to check the limit behavior of 
R(e) predicted by (|To|) . On top left only values e < 10 are shown. On this scale no 
differences can be seen between R(e) and A(e) num for e > 3. For small e the differences are 
clear and they are quite dramatic for e < 0.4. 
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On the bottom left plot the tiny differences A(e) num — R(e) are displayed. It seems that 
they tend to as e — > oo, in agreement with the second possibility in Section ^. Finally the 
behavior of A(e) num for e small is seen in detail on the bottom right part. A logarithmic 
scale has been used to reveal that log(A(e) num ) is dominated by a function of the form 
— c/e for some c > 0. Due to the smallness of the estimates and to the fact that it would 
require an enormous effort to estimate A(e) for e close to 0.1 (unless other methods, from 
deterministic analysis, are used), it is not completely clear what the correct behavior is. 
Even with a reduced set of data a fit of the values obtained for 0.16 < e < 0.3 gives a 
result of the form 

log(A(e) num ) « 2.45 - ^ (13) 

which must be taken with caution, but seems to give the correct trend. This suggests 
that the inequality in Question 1.1 is not satisfied for small e, which will be confirmed 
theoretically in Section [/]. This fact is not a surprise, because similar facts occur in generic 
analytic families of area-preserving diffeomorphisms. The smallness of the Lyapunov ex- 
ponent is related to the area of the chaotic seas which in turn is related to the splitting 
of the separatrices of fixed and periodic point for maps close to the identity. See |FS], 
where general upper estimates can be found. In fact, as with many a priori exponentially 
small upper bounds, this result can also be obtained as a corollary of averaging theory for 



analytic systems, see N and also Section 7.2 



Finally it should be mentioned that some computations have been done for large values 
of £ (up to 10 6 ). Due to the strong chaotic properties it is enough to take small values (say 
N p = 256) of the number of initial points in S 2 . But the grid in the parameters (9 g ,j3 g ) 
has to contain more points. Typical values of N g to have a good determination of A(e) num 
are 2 9 and 2 10 . The results confirm what is seen in the left lower part of figure xxx, that 
is, A(e) num > R(e) and the difference goes to zero slowly. 



7 The case of small e 

As it is clear that the greatest problems occur for small e, it is worth it to carry out 
a preliminary analysis of the dynamics in this case. The first item to be studied is the 
location and stability of fixed points. This can be carried out, with the same effort, for 
any e. Furthermore this allows us to see how bifurcations give rise to new elliptic fixed 
points with the corresponding creation of islands. Later on the global behavior of g o f e 
on S 2 is discussed. In what follows it is assumed that e > 0. 



7.1 Fixed points and their stability 

To look for fixed points of gof e it is enough to consider axes of rotation with zero longitude. 
Let ((3, 6) be the latitude of the axis and the angle of the rotation. As follows from Section 



5.1, it is not restrictive to assume (3 6 [0, w/2]. Then a fixed point A is mapped by f £ to 



a point A' which by g returns to A. Let b be the latitude of A. It is clear that A and A 1 
must have symmetric longitudes, —5(b) and 5(b), respectively, where 5(b) = f e(l + sin(6)). 
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It is easy to derive the condition for the fixed points 

sin f— ^ sin(/3) cos(6) cos(<5(6)) — sin f— ^ cos(/5) sin(6)+cos f — ^ cos(6) sin(5(6)) = 0. (14) 

For e small one has sin(<5(&)) = O(e) and cos(<5(6)) = 1 — 0(e 2 )- For the analysis of this 
case it is more convenient to write ( |i~4| ) in the form 

sin (^j sin(/3 - b) - e 2 ^- sin sin(/3) cos(6)(l + sin(6)) 2 

+ e| cos f -J cos(6)(l+sin(6)) + 0(e 3 ) = 0. (15) 

From (|D|) it is clear that if sin (|) = 0(1), that is, 9 is not too close to from the 
positive or the negative side, then one has solutions for b of the form 

b = (3 + 7 e + 0(e 2 ) or b = tt + (3 + 7 e + 0(e 2 ), 

where 7 is independent of e. The value for 7 is given by 

7T 

7=^ COS (/ ? )( 1 ± Sin (/?))/ tal1 . 

where the + sign is used in the first case and the — sign in the second. In both cases b 
is close to either (3 or (3 + tt and an important thing is that there are exactly two fixed 
points for g o f e . 

Otherwise one can write 9 = mire, m > or 9 = 2ir + mire, m < 0. The dominant 
terms in (|l~5| ) become in both cases 

msin(6-/3)-cos(6)(l + sin(6)) =0. (16) 

Equation (|l6| ) has to be seen as an equation for b depending on (3 and m (which accounts 
for 9). As it has zero average it should have at least two different zeros. To look for more 
solutions it is relevant to compute the lines (in m)) where double zeros occur. The 
angle b can be used to parameterize these lines. They are represented as 

m 2 = 2 + sin 3 (b) - § sin 2 (26) , 

_ (17) 
(3 = 6-arg[sin(6)-cos(26)- v /r T(cos(6) + isin(26))] . 



As (3 6 [0, 7r/2], inspection of ( jig ) shows that no double zeros can occur in the case m > 0. 
Hence the value of m is confined to [—2, 0]. The bounds on (3 also imply that the parameter 
b in ( |l~7| ) has to be in [tt/2, 3tt/2\. It is elementary to discuss the behavior of (m,/5) as a 
function of b. It is better seen by looking at figure xxx. It can just be said that two curves 
of double zeros appear with (3 G [7r/4,7r/2]. They meet at m = —\[2, (3 = 7r/4, where a 
triple zero appears. Between both curves there are exactly four zeros. 
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When additional powers of e are included, a routine application of the Implicit Function 
Theorem permits us to conclude the same behavior for the full equation (|14|). It should 
be noted that f3 = tt/2 has to be excluded from the previous analysis: in that case the 
axis of rotation is also the axis of the twist. 

The preceding analysis can be summarized as 

Proposition 7.1 For e small enough and any g G SO(3) there are always at least two 
fixed points of go f e . Bifurcations to exactly four fixed point appear for any longitude of the 
rotation axis and for latitude of the axis and angle of rotation (/3, 9 = ir(2 — me + 0(e 2 ))) 
along two lines described by formulae ( |i~7| ) when the parameter b ranges in (tt/2,tt) and 
(ir,3ir/2), respectively. 

To discuss bifurcations of the fixed points for general values of e is an elementary 
but cumbersome task. As an illustration the case of bifurcations appearing on (3 = is 
presented. Then (|l4]) reduces to 

cos f cos(fr) sin(5(6)) - sin f sin(6) = (18) 

and the condition for a double root becomes 

— tan ( ^(1 + sin^)) = sin(6) cos 2 (fe). (19) 

The degenerate cases |6| = tt/2 must be excluded in (|i~9|). It is immediate that new double 
fixed points appear on S 2 with (3 = if and only if e is a positive integer. The number 
of double fixed points with j3 = (and some 9) increases with e. Also from (|l^) it follows 
that new zeros appear near 9 = 0, one for 9 > and the other for 9 < 0. These zeros move 
towards 9 = ir without ever reaching it. So, it is a simple matter to state how many fixed 
points exist for (3 = (except at the bifurcation values of e): there are at most 2(1 + E(e)), 
where E denotes the integer part of e. For a given non-integer e there are always values 
of 9 such that this number 2(1 + E{e)) is the exact number of fixed points. This has an 
elementary dynamical interpretation: new fixed points emanate from the north pole of S 2 
when the rotation number of f e at the north pole (defined by continuity) passes through 
(mod 1). 

To study the stability of the fixed points we recall that they are generically elliptic 
(eigenvalues fi in S 1 \ {±1}), hyperbolic (real positive eigenvalues) and hyperbolic with 
reflection (real negative eigenvalues). Let E,H,R denote the number of fixed points of 
each type. Euler-Poincare formula gives E — H + R = 2 (for simple fixed points). At the 
creation of new fixed points E and H increase by 1. When double eigenvalues are equal 
to —1 then, generically, E decreases by 1 and R increases by 1. 

An analytic discussion on the stability of the fixed points is elementary (at least for 
small e) but cumbersome. It is worth mentioning that, for any e the maximum eigenvalue 
at a fixed point is achieved on b = and has the expression 



/w = y + ( 1+ (t) ) • (20) 
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A sample of illustrations is shown in figure xxx having 9/2n as horizontal variable and 
j3/ it as vertical one. A region containing i (resp. j, k) fixed points of type E (resp. H, R) 
is denoted as R k WE % . On the top left plot and for e = 0.1 the two curves on the upper 
part of the plot are the curves of double zeros given by (|17|). Only in the region bounded 
by them there are 4 fixed points; the code is H 1 E 5 . The codes for the black, dark grey and 
light grey regions are, respectively, R 2 , E 2 and R 1 E 1 . On the top right plot, the value of 
e is ~ 3.456789. The region containing the point (0.5, 0) has exactly 2 fixed points while 
the regions which contact with this one through arcs have 4. The darker region has 6 and 
the small region near the upper right corner has 10. The regions around 8 = have 8 
fixed points. The solid lines give the location of all bifurcations and changes of stability. 

In the bottom left plot, computed for e ~ 9.876543, all the lines of bifurcation or 
change of stability are plotted. The number of fixed points, NFP, in the major regions 
is shown. The typical transitions are as follow: Consider, for instance, a passage from 
NFP= 16 to NFP= 18 near (3 = with increasing 9. First a line of creation of an elliptic 
and hyperbolic points is found. One passes from a code R 9 H 7 to R 9 H 8 E 1 . This is followed 
by a change of stability by passing to R 10 H 8 . Later on, inside the region with NFP= 20, 
the points of R type become again of type E. So the code passes, in the different changes, 
from R U H 9 to H 9 E U . 

Finally, in the bottom right plot, for s = 0.5, regions similar to the case e = 0.1 can 
be seen, with a different configuration. The level lines of figure xxx are also displayed. It 
is checked that the highest levels correspond to domains where the map has exactly one 
elliptic and one hyperbolic fixed points. This fact is also present for smaller values of e. 



7.2 The maps as perturbed twists 

The object of interest is the global dynamics of g o f £ in S 2 . To this end it is convenient 
to write these maps in a slightly different, but equivalent, way. For this study S 2 will be 
taken as the sphere of radius 1 centered at the origin. Instead of considering f e as a twist 
around the z-axis of angle ire(l + z), it will be taken as a twist around the axis of zero 
longitude and latitude (3 with angle of rotation around this axis of a point of coordinates 
(x,y,z) equal to tte(1 + xcos(P) + zsin(f3)). Then, the rotation g is simply a rotation of 

(z) 

angle 9 around the z-axis, to be denoted by R e . Up to the substitution of tt/2 — (3 for f3, 
the relative positions of the axes in this formulation and in the previous one are equivalent. 

It is instructive to first consider the case 9 = 2-7r|, where p, q are coprime integers. Let 
us introduce 5 = 7r(l + xcos(/3) + z sin (/?)). Then 

-ysin(/3) \ 
xsm(f3) - zcos([3) \+0(e 2 ). (21) 
ycos(/J) / 

(z) 

The next step is the computation of the map M q> Q t p ie := (R e o f £ ) q , the parameter j3 
being the latitude of the axis of f £ . It is clear that at order zero in e one has Mq t $ = Id. 
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An elementary computation using formula (21) for f e and the expression of 5 as a function 
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of x, z and j3, gives 



M, 



q,e,P,e 
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y 










V * / 







+ 0(e 2 



(22) 



(z) 

where R\ is now a rotation around the z-axis in each one of the horizontal planes with 
angle of rotation depending on z as follows 

7 = 7r£ ( 7(sin(/?) + zP 2 (sin(/3))), (23) 



where P2 denotes the second Legendre polynomial (P2(w) = ^w 2 — 5). 

This result tells us that the rotation R s averages the effect of the map f £ in a good 
way. Let us remark that in (|22|) it is assumed p/q fixed and e sufficiently small. From 



(23) it follows that the angle 7 is still small provided that eq is small. In the trivial case 
p = 0, q = 1 the rotation is the identity and then the twist can be be taken with (3 = vr/2, 



recovering in formula (23) the angle rotated in the twist. 

To pass to the general case for 6 one needs a preliminary lemma. 



Lemma 7.2 Let p G (0, 1) and N G N. Let 

S Pt N = (J 

l<q<N, 0<p<q, (p,g)=l 



T/ien, if N + 1 > p^ 1 one has [0, 1] C 5, 



Proof: Let a G [0, 1]. If a = r/s G Q with (r, s) = 1 and s < N, there is nothing to prove. 
Otherwise consider the approximants to a given by the continued fraction algorithm. 
Assume that ^ and ^ are consecutive approximants with qi < N and qi > N. Then 



a 



Pi 
9i 



< 



1 



< 



1 



9i92 ~ 9i(iV + l) 



< 



91 



QED 



Hence ^ can be written as | + ^, where \p\ < p, q + 1 < p^ 1 , where p is not specified 
for the moment. Hence one can represent the map g o f £ as something similar to the 
previous case, that is, a rotation whose angle is a rational multiple of 2ir, composed with 
a map close to the identity, by writing 



9° fe 



By a direct computation one obtains expression (| 
tions: 



again with the following modifica- 



If the rational which approximates 4- is 0, then q = 1 and there is no average, so 
that we keep the map g o f s , 
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• The rotation is now 7 = ireq (sin(/3) + 2p2(sm(/3))) + 2irp, 

2 

• The error terms are, uniformly in 9, of the form 0(e 2 q + 

If one takes p = e 2 / 3 then the maps (go f £ ) q are, in all cases, e 1//3 -close to the identity and 
the error terms are at most e 4 / 3 . Note that besides the choice p = e 2 ^ there are other 
possibilities, but p = e 2 / 3 is good enough to prove Corollary 7.7. Finally, it is clear that 



(|23|) , or the modification just mentioned adding 2irp, is a twist except for j3 = /3 orit such 
that P 2 (sin(/? crit )) = (/3 crit = sin" 1 (1/^3)). 
To summarize we state the following 

Proposition 7.3 If 9 is not e 2 ^ -close to zero and (3 7^ (3 crit the maps g o f £ for e small 
enough, have a power which is e 1 / 3 -close to the identity. This power satisfies a twist 
condition of order at least EP2 (sin (/?)). 

Remark 7.4 Rotations g with small 9 are irrelevant, for the present purpose, due to the 
Haar measure in SO(3), and a single exceptional case (non-twist) is also unimportant. 
This will be seen later in detail. 

Remark 7.5 In the case of small 9 it is still possible to show that g o f £ produces a twist 
effect on each meridian in S 2 . A problem which appears though is that the angle rotated by 
the different points can pass through an extremum, losing in this way the twist property. 
In fact this is not so important because the existence of invariant curves when the twist 



condition is lost at some point has been established in [Sl|. But this refinement is not 
necessary in the present context. 

Theorem 7.6 With the possible exclusion of an open set B in SO (3) of small measure, 
there exist Eo such that for e < Eo the maps g o f E have a dynamics exponentially close to 
an integrable flow in S 2 . 

Proof: The proof is divided into steps. 

1. The maps M„g g £ , being a power of g o f £ , have the same dynamics as g o f £ . In 
all cases (including q = 1 and the exceptional value of 0) they are e 1//3 -close to the 
identity. Hence there exists a suspension given by the flow of a 1-periodic vector field 
in S 2 such that the time-1 map associated to this flow coincides with M qj g : p j£ . The 
vector field is "slow" (of the order of e 1 / 3 ) and the dominant terms do not depend 



on time. See [BRS] for details and an explicit construction. 

It is relevant to note that the vector field is analytic with respect to the phase space 
variables (that is, the points in S 2 ) while the dependence in e is discontinuous in 
SO(3) (moving 9 £ [0, 2ir\ changes the value of q), but the relevant thing is that it 
is bounded in e. Furthermore the dependence with respect to t can be made of class 
C r for any r > 0, but continuity in t is sufficient for what follows. Furthermore the 
vector field is Hamiltonian. 
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2. The next step is to "average" the vector field with respect to t. This is the content 
of Neishtadt's theorem Q. See p3] for a detailed proof. As a consequence the 
vector field can be written as an autonomous part and a remainder which is expo- 
nentially small in the current small parameter; that is, the remainder is bounded 
by exp(— ce^ 1 / 3 ) for some c > 0. Furthermore, the averaged vector field is still 
Hamiltonian (see H5"V| for a sketch of the proof). 

3. As the averaged system is a Hamiltonian in S 2 , it is integrable and, hence, foliated by 
invariant curves except on the separatrices, which are a set of zero measure. Most of 
the invariant curves subsist as a consequence of Moser's twist theorem. To this end 
one should have that the perturbation is small compared with the twist condition. 
Hence, it is enough to exclude a neighborhood of the critical latitude (3 ciit which can 
be taken also exponentially small. Furthermore the set of points in S 2 not covered 
by invariant curves of the full system has a measure bounded by the square root of 
the perturbation, again exponentially small in e. 

Summarizing, when arbitrary g £ SO (3) \B are considered the dynamics in S 2 is ordered 
(the points lie on invariant curves) except for points in a subset of S 2 of exponentially 
small measure. Furthermore B consists of a neighborhood of the identity of size 0(e 2//3 ) 
and a neighborhood of /3 crit which is exponentially small in e. QED 



Corollary 7.7 For e small enough A(e) < As 3 . 

Proof: It is sufficient to make remark f^J more explicit. The differentials of the maps f £ 
increase the length of the vectors in TS 2 by a factor of the form 1 + 0(e) and composing 
with g £ SO(3) produces no essential changes in the factor. Hence the values of Ai(x, gof e ) 
are bounded by Ce, where C is a positive constant. The contribution to A(e) of the g to 
which Theorem |7.6| applies is bounded by Ce times an exponentially small amount. On 
the other hand the contribution of the excluded neighborhood of /3 crit is also exponentially 
small. 

Therefore the main contribution to A(e) can only come from the neighborhood of the 



identity excluded in Theorem 7.6. But the Haar measure of this set is of the order of 

re 2 / 3 

(1 -cos(0))d0 = 0{e 2 ). 

This bound and the previous one on Ai(x, g o f £ ) give the result. QED 

If we want to consider more "realistic" upper bounds it is possible to proceed along 



the ideas in remark 7.5. A further consideration is that the largest stochastic zones are 
typically associated to the splitting of separatrices of the hyperbolic fixed points. From 
[FS] it follows that the splitting can be bounded by 



ex P i i i 

log(/U n 
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where /x max is the maximal eigenvalue at the fixed points and c > 0. From (|2^) one has 
that for e small log(/i max ) = ^ + 0(e 2 ). This "heuristic" prediction is in good agreement 
with the observed behavior for e small. 

8 The case of large e 

Let us call the subbundle of PS 2 tangent to the invariant circles of f £ the horizontal 
bundle and denote it by H. As e — > oo, a large portion of PS 2 is sucked into a small 
neighborhood of H under f e #. The measure & g (H) 011 -PS" 2 that is atomic in each tangent 
space and supported on g(H) looks more and more like an invariant measure for gf e ,#- 
These measures integrate to give Lebesgue: 

m= / 5 g{H) dv(g). 

Jg&SO(3) 

This yields a heuristic argument for why the inequality in Question |0] should hold when 
e = 00. In this section, we make this argument rigorous by adding some <5-noise in JF £ , 
and replacing invariant measures with stationary measures. We prove: 

Theorem 8.1 Let m Et s be defined as in Section 3, and let ip E $ be the density of m e ^: 

dm £y s = (fejdm. 

There exists C > such that, for all e, 5 > 0, 

y E , s -i\\i <c5- n £-v 2 

where \\ ■ \\\ is the L l -norm with respect to Lebesgue measure m on PS 2 . 

This has the corollary: 
Corollary 8.2 There exists a C > such that for all e,5 > 0, 

\R(e, 5) - 12(e) I < C^V 1 / 2 logs. 

In particular, for all 5 > 0, 

lim \R(e,5) - 22(e) | = 0, 
where 22(e, 5) is the random diffused exponent defined in Section [3|. 

As we were finishing this paper, unpublished work of L. Carleson and T. Spencer came 
to our attention |CaSp| . For the standard map on the 2-torus: 

g £ : (x, y) 1— > (2x + e sin(27rx) — y, x) 
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where e measures the strength of the nonlinearity, they prove that by adding a noise of 
strength exp — e 2 to the element g £ , a Lyapunov exponent of order loge can be established. 



Proof of Corollary |8.2| . From the definitions, 

R(e,6) = [ log\\Tf £ v\\dm £ , 5 (v) 

J PS 2 



log \\Tf e v\\(p £jS (v) dm(v), 

PS 2 

whereas R(e) is the integral of log \\Tf e v || with respect to m. Hence, 



\R(e,5)-R(e)\ = \ log \\Tf £ v\\(<p £iS (v) - 1) dm(v)\ 
Jps 2 

< || logT/eHoo || <pe,5 ~ 1 ||l 

< C5- n e- 1/2 \oge, 



by Theorem 8.1. QED 



Proof of Theorem 8.1 



SO (3) acts transitively on T\S 2 by isometries and with trivial stabilizer. From now on 
we identify points in T\S 2 with elements of SO(3), and use the group structure in writing 
our formulas. We will use x,y,z to denote elements of PS 2 , p,q for elements of 5 2 , and 
{Pi v )i (li w ) f° r elements of PS 2 (or TiS 2 ). 

Recall from Section 3 that 



Ve,5 



Jps 2 



where tp e ,6,g is the fixed point of the operator L £ ^ g defined by: 

L £ ,s, g ^{x) = . ] f ip(y)dm(y) 
m(B(x,6)) J(gf et )-i B (x,5) 

Setting 

= rat nT ^f^)^)' 
m(B(e, o)) v ; 

where e is the identity element of SO (3), we rewrite L £ ^^ g as: 

L E ,s,g^(x) = / k s {x- l gf n (y))^{y)dm{y). (24) 
Jps 2 

Let L £j< 5 = L £> g jB . It is clear from (24) that L £> $ tg ip(x) = L £j sip(g~ 1 x) 

Denote by K £j $ : PS 2 x PS 2 — > R + the kernel of the operator L £> $, so that 

K £) s{x,y) = k s (x~ x f £ ^(y)). 
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Let 7r : PS 2 — ► S* 2 be the projection along tangent fibers. By averaging along fibers, we 
shall approximate K £) $ by a new kernel K £ ,5 that is constant along fibers of the second 
PS 2 - factor. Define K sS : PS 2 x PS 2 -> R+ by 



K EtS (x,y)= K et s(x,z)dm ny (z), 

where, for p *E S 2 , m p denotes the disintegration of m along the fiber it~ 1 p. For g G SO (3), 
we obtain a new operator L £i( 5. 9 on L°°(PS 2 ), given by: 

L e ,6, g 4>( x ) = / K £ ^(g^ 1 x,y)4>(y)dm(y). 

J PS 2 

Let L e> 5 = L e> s,e- 

The next lemma shows that the operators L £ ,8,g have a good averaging property when 
applied to densities of measures that project to Lebesgue measure (jl on S 2 . 

Lemma 8.3 Let K : PS 2 x PS 2 -> R+ 6e any L 1 function such that: 

1. for allp G S 2 , 

/ / K(x,y)dm p {x)dm{y) = 1; 

JlT^pJPS 2 

2. if ir(y) = ir(z), then for all x, K(x,y) = K(x, z). 

For (j) G L°°(PS 2 ), and g G 50(3), de/me L 9 </> G L°°(PS 2 ) by 

L g (j>(x) = K(g~ 1 x,y)(j)(y)dm(y). 
J PS 2 

Then: 

a. <p g = Lgip is the unique fixed point of L g , for any <p G L°°(PS 2 ) that satisfies: 

(p(z) dm p (z) = 1, 



I 



for allp G S 2 . 
b. for all x G PS 2 , we have: 

/ g (x)dv(g) = 1. 
JSO(3) 

Remark 8.4 Lemma \8.3j applies to the operators L E> g g . An example of a function <p that 
satisfies the hypotheses of Lemma \8. Q is the density <f £ ,5,g> f or an U £,$,9- We do not use 
that g = Lg(p g below. 
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Proof of Lemma |8.3| : Since it is constant along fibers of the second factor, K projects 
to a function on PS 2 x S 2 , which we shall also call K. 
For g £ SO (3), define <p g by: 

J P es 2 

We compute directly that, for any tp satisfying the hypotheses of a., 
L,(f)(x) = / k (r ^ y)v{y)im{y) 

JydPS 2 

/ K(g~ 1 x,p)ip(z)dm p (z)dn(p) 
pes 2 J zeir- 1 p 

K(g~ 1 x,p) dfi(p) 

p&s 2 

To see that L g ip g = (p g and finish the proof of a it is now sufficient to verify that 

<Pg(z) dm p (z) = 1. 

But g (z) = Lgl(z) = Lgl(z) since the function 1 is constant and hence constant on fibers. 
Now L g (l) is the density function of a measure on PS 2 which covers Lebesgue measure 
on S 2 and hence its integral on fibers equals 1. This proves a. 
Integrating <p g (x) with respect to g we obtain: 

/ ip g (x)du(g) = / K(g' 1 x,p)dn(p)diy(g) 

Jg&SO(3) Jg&SO(3) J p&S 2 

\ K(y,p)dfi(p)dm(y) 
i&PS 2 Jp&S 2 

= 1, 

completing the proof of b. QED 
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Returning to the proof of Theorem 8A, let ip e> s,g = L e ,6,gl = L £ ,8,g^p £ ,8,g be the unique 
fixed point of L £ ,8,g given by Lemma 8.3. We now have: 



</>£,<5-l||i = I! / {VeAg ~ l )dv{g) ||i 

Jg£SO(3) 



(Ve,8,g ~ <Pe,5,g) dv{g) \\i + \\ I {0e,S,9 ~ X ) dv (d) 

JO(3) JgdSO(3) 

< || / (L £ ,8,g<f £ ,5,g ~ L £tS ,g(p £ ,5,g)dv(g) ||i 

Jg&SO(3) 

+ II / {0e,5,g ~ ^)dv{g) ||i 

Jg&SO(3) 

= II / (L £ ,8f £ ,8,g ° g' 1 - L £)S ip £ ,5 ) gO g~ l )dv(g) 111 

j 9 eso(3) 

< / || L Et S(fe,S,g ° 5 -1 - L £> s<p £ ,5,g ° fl r_1 111 <M(?) 



where we used Lemma 3.2 to obtain the second to last inequality. 



Propositions 8J3 and 8J3, which we state and prove below, now imply that 
/ \\ L £! sf £ ,8,g° g' 1 ~ L £ ^ £ ,8.g° g" 1 \\idiy(g) < Ci<T 3 £~ 1/2 / \\<p £ ,s,g\\oo dv(g) 

Jg&SO(3) Jg&SO{3) 

< C 2 5- 3 e- 1 / 2 [ 5-*dv{g) 

JgeSO(3) 

= Cd-^e- 1 ' 2 , 



completing the proof of the theorem. It remains to state and prove Propositions 8.5 and 



8.6. 



Proposition 8.5 There is a C > such that, if e > 6 4 , then for any 4> G L°°(PS 2 ), 

WU.sU) - Lj6)h < ci^iioort- 1 / 2 . 



Proposition 8.6 There exists C > such that, for all £ > and 5 > 0, 

\\fe,8,g\\oo < C5~ 8 . 



Proof of Proposition 8.5 



\L £ A 



L £ ,s 



< 



oo\\K £) s - K £ ,s\\i 



\K E ,s(x,y) - K £tS (x,y)\dm(x)dm(y) 



\ K E,s( x >y) 



K £i s(x, z) dm ny (z)\ dm{x) dm(y). 



7r ±r jvy 



44 



Let x = (p,v),y = (q,w) be elements of PS 2 . For a fixed p,v,q, the map w i— > 
if £ <$((p, v), (g, it?)) is a constant multiple c5~ 3 of the characteristic function for ir~ 1 q n 
/ e T -B((p, «),<£). Note that, for any measurable set B in a probability space with measure 
m, the average value of the function \m(B) — 1b\ is 2m(B)(l — m(B)). It follows that: 

II K e , s -K e , s 111 = 2c<T 3 / / | 0(p,v,q) (l-(3(p,v,q)) \dm((p,v))d{i(q), 

where 

P(p,v,q) = m q {fe^B{(jp,v),5)). 

We next show that for some k > 0, and for e > <5 -4 , there is a set G C PS 2 , with 
m(G) > 1 - e~ 1/2 such that, for all (p,v) G G, there is a set G' = G'(p,v) C f £ l B(p,5) 
with //(G') > /i(-B(p, (5)) - e _1/2 , such that, for q G G' , 

P(p,v,q) < ke~ 1/2 or P(p,v,q) > 1 - ke~ 1/2 . (25) 

This implies that 

\\K £:S -K £ ,sh < 2cr 3 (fe- 1 /V(P(p,<5))+2e- 1 / 2 ) 

which implies the result. 

Fix 5 < 1/2, and assume that e > 5~ 4 . For a > 0, denote by C a the a-neighborhood 
of the horizontal bundle H C PS 12 . In other words, C Q is the set of lines in PS 2 \ 
ir~ 1 {NP, SP} at angle < a with the latitudinal circles. 

It is not difficult to see that if the distance from p to the poles is greater than e -1 / 4 , 
then 

m^if-^C^nTr^p)) > 1-e- 1 / 2 . (26) 

Let G be the set of p for which (|26|) holds, and let G = tt^Gq. Then m(G) > 1 - e^ 1 / 2 . 

Fix (p, v) G G, and consider a point f £ q G P(p, 5) Pi Go- The intersection of B((p, v),6) 
with the fiber ir~ 1 f £ q is an interval I. If the endpoints of / are disjoint from the interval 
J = C £ -i/2 PI 7r _1 / e g, then either / D J or / n J = 0. In the former case, the length of 
jgji-f is greater than the length of f £ -uJ-, which by ( |26| ) is greater than 1 — e _1//2 . In the 
latter case, the length of f £ ]jJ is less than e -1 / 2 . Hence if we let G' = G'(p,v) be the set 
of q G f~ 1 B(p, 8) n Go satisfying: 

dB((p,v),6)nC £ - 1/2 mr- 1 f £ q = 0, (27) 

then ( |25| ) holds for all g G G'. It remains to show that /u(G') > fi(B(p,8)) — e^ 1 / 2 . 

For (p,v) G PS 12 , denote by S((p,v),8) the geodesic sphere of radius 8 centered at 
(j?,v), so S((p,v),8) = dB((p,v),d). We will use the following lemma here and later in 
the proof of Proposition [Dj. 
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Lemma 8.7 There exists C > such that, for all (p,v) G vr 1 p, and all a > 0, 

(tt (S((p,t)),i)nC a )) < C5a. 
Proof. The claim follows from the following facts: 

1. On T\S 2 \tt~ 1 {NP, SP}, the subbundle H (regarded as a submanifold) is uniformly 
transverse to the fibers of PS 2 . 

2. There exists a C > such that for all 5 sufficiently small, and for all (p,v) G PS 2 , 
the intersection S((p,v), 5) Pi H is contained in a smooth curve of length < C5. 

The verification of these facts is left as an exercise. QED 
From Lemma |8.7| it follows that: 



K G> ) > KB(p,5))-^G )-^f- 1 7r(S((p,v),5)nC e - 1/2 ) 
> / i(S(p,«5))- £ - 1 /2 



This completes the proof of Proposition 3.5. QED 



Proof of Proposition 8M: We know that tp e ,5.g is a function in L 1 which satisfies, for 
all (p, v) G PS 2 , 

<Pe,6,g(p,v) = c5~ 3 / d<p €}S ,g(q,w) dfi(q)dm q (w) (28) 

Jf^B(g-i(p,v),5) 



and, for all p G S 1 



2 



<p €> 8 >g (p,v) drn p (v) = 1. (29) 



Then, by (pq), the function y e $p is continuous, and therefore has a maximum M e ^^ g 
that we denote by M. The idea is that a Holder constant for (p e ,s,g can be estimated in 



terms of M. Reporting in (29) gives a bound for M which is independent of s, g. Since we 
want to use ( p9| ) at the end, it suffices to consider the Holder constant along the fibers. 
So, let (p,v), (p,v') G n~ l p. We have: 

\<Pe Ag (p,v) - <p £ ,6,g(p,v')\ < cM5~ 3 m {f^B{g-\p,v),5) A f^B{g-\p,v'),5)), 

where m is Lebesgue measure on T\S 2 , and AAB stands for the set of points which belong 
to only one of the subsets A or B. 

Lemma 8.8 There exists C > such that, for all v,v' G Ti^S 2 , 

m {f^B{g~\p,v),5) A f^B{g-\p,v'),5)) < C5d{v,v') l '\ 
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Remark 8.9 In Proposition 3.3., we prove that (f e Sg is as smooth as f e . The point of 
the arguments here is to get a Holder constant independent of e. 

From this lemma, it then follows that M < 5(Cc) 4 <5 _8 /2, since: 

T P S 2 



/go 
(l-Cc<T 2 |t| 1/4 )+ dt 
-oo 



Secy's*. 



We now prove Lemma 18.81. 



We have the two balls B((p,v),5)) and B((p,v'), 5). Let a = \J d(v, v'). We may 
assume that a « 5. The set B((p,v),5))AB((p,v'),S) meets the fiber -K~ l q in a pair of 
intervals, each of length < a 2 « \fa. If the endpoints of these intervals do not lie in 
C aj, then the entire intervals must be disjoint from Cn^. In other words, if 

•K~ x q n (5((p, v),S)U 5((p, v'),S)) nC^ = 0, (30) 

then 

iT x q n B((p, v),5))AB((p, v'),8) nC^ = l (31) 
Let G C S 2 be the set of q satisfying <M). By Lemma O, > 1 - 2(75^"- 



Claim 8.10 There exists a C > swc/i i/tot, /or a// a < 1, p £ S 2 , and e > 0, if 
(P, v) i C^, then 

\\T( p , v )f^\ Tl , p s 2 \\ < CoT 1 . 

Proof. Recall that 1 1 ^(p.u) /^~# I tt- J-p 1 1 = H^p/fT^H -2 - With respect to the orthonormal 
basis of T p S 2 of the form {ei(p), e2(p)}, where e±(p) E H points in the direction of f £ - 
twist and e2(p) points toward the north pole NP, T p f £ takes the form: 



for some (5 > 0. A direct computation shows that there exists a constant C > such that, 
for all a, /3 > 0, if the angle between a unit vector v £ R 2 and the x-axis is greater than 
/ a;, then: 



1 -0 
1 



2 < Cor 1 



From this the claim follows. QED 

Claim |8.10| and (pl|) imply that for q € G, the derivative of / e # on 
7r _1 (7 n B((p,v),5)AB((p,v'),5) is bounded: 

\\ T (g,w)fe#\T liq S 2 \\ < a" 1 ; 
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for all w such that (q,w) G B((p, v), 5))AB((p, v'), 5). But for q G G, 



m f - lq ( /^(B((p, v),5))AB((p, v'),5) ) < a^m, ( B((p, v),S))AB((p, v'),S) ) 

< a~ l d{v,v') 



/xl/2 



But then 



m 



B((p,*;),<r))AB((py),0 < 2C 1 5a 1 ' 2 + C 2 5 2 d{v,v') 1 / 2 

< C5d{v,v') l/ \ 



completing the proof of Proposition 8^ and of Theorem p.l| . QED 

9 Discussion 



We have wondered |BuPuShWi] about the relationship of the random Lyapunov exponent 
of a measure on the space of volume preserving diffeomorphisms of a manifold to the 
mean of the Lyapunov exponents of the individual members. The point of the question 
we raised was to be able to conclude that in a rich enough family of diffeomorphisms 
there must be some with positive Lyapunov exponents, that is to say positive entropy. 
At question is what sort of notion of richness would make such a conclusion valid. We 
even proposed that much more might conceivably be true, a lower bound for the mean 
of the Lyapunov exponents in terms of the random exponents for orthogonally invariant 
measures on volume preserving diffeomorphisms of the sphere. The orthogonal invariance 
of the measure was to provide the necessary "richness". 

In the studied family strong numerical evidence has been found about the existence of 
such a lower bound when the values of the stretching parameter e are not too small. In 
some sense strong stretching has an effect similar to randomization, but it depends in a 
clear way on the concrete map. More concretely 

• Even moderate values of e like e > 10 are enough to have an average of the metric 
entropy larger than the one corresponding to the random map. 

• There exist unbounded parameters e for which islands are born. The range of exis- 
tence of these islands is small, but only the islands associated to fixed points have 
been considered. 

• For small e the estimated average entropy seems positive and definitely to be much 
less than the one of the random map. The numerical evidence is in favor of the 
existence of exponentially small lower and upper bounds (in the present example, 
with an analytic family). 

The problems in numerically estimating exponents and how to overcome them have 
been discussed. A partial analysis of the family of maps has been done for e small. Even 
a rough estimate of an upper bound of the averaged entropy is enough to show that the 
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this averaged entropy falls below any constant multiple of the entropy of the randomized 
system, if e is sufficiently small. 

Finally, the effect of a small randomization of fixed size S of the individual elements of 
the family JF £ is considered. Now the mean of the local random exponents of the family is 
indeed asymptotic to the random exponent of the entire family as e tends to infinity; that 
is, R(e, 5) and R(e) are asymptotic. 
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